WO 01/23604 
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2) INFORMATION FOR SEQ ID NO: 1053 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1263 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R581 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1053 

20 CATACGGCCT ATTATACCGA GCAAGCGTTG CGTGATACCG TTGAAAAAAC 50 

CATTAAAAAC TGTTTGGATT TTGAAAGGAG ACAGGAGCAT GAATAGAATA 100 

AAAGTTGCAA TACTGTTTGG GGGTTGCTCA GAGGAGCATG ACGTATCGGT 150 

AAAATCTGCA ATAGAGATAG CCGCTAACAT TAATAAAGAA AAATACGAGC 200 

CGTTATACAT TGGAATTACG AAATCTGGTG TATGGAAAAT GTGCGAAAAA 250 

25 CCTTGCGCGG AATGGGAAAA CGACAATTGC TATTCAGCTG TACTCTCGCC 300 

GGATAAAAAA ATGCACGGAT TACTTGTTAA AAAGAACCAT GAATATGAAA 350 

TCAACCATGT TGATGTAGCA TTTTCAGCTT TGCATGGCAA GTCAGGTGAA 400 

GATGGATCCA TACAAGGTCT GTTTGAATTG TCCGGTATCC CTTTTGTAGG 450 

CTGCGATATT CAAAGCTCAG CAATTTGTAT GGACAAATCG TTGACATACA 500 

3 0 TCGTTGCGAA AAATGCTGGG ATAGCTACTC CCGCCTTTTG GGTTATTAAT 550 

AAAGATGATA GGCCGGTGGC AGCTACGTTT ACCTATCCTG TTTTTGTTAA 600 

GCCGGCGCGT TCAGGCTCAT CCTTCGGTGT GAAAAAAGTC AATAGCGCGG 650 

ACGAATTGGA CTACGCAATT GAATCGGCAA GACAATATGA CAGCAAAATC 700 

TTAATTGAG.C AGGCTGTTTC GGGCTGTGAG GTCGGTTGTG CGGTATTGGG 750 

3 5 AAACAGTGCC GCGTTAGCTG TTGGCGAGGT GGACCAAATC AGGCTGCAGT 800 

ACGGAATCTT TCGTATTCAT CAGGAAGTCG AGCCGGAAAA AGGCTCTGAA 850 

AACGCAGTTA TAACCGTTCC CGCAGACCTT TCAGCAGAGG AGCGAGGACG 900 

GATACAGGAA ACGGCAAAAA AAATATATAA AGCGCTCGGC TGTAGAGGTC 950 

TAGCCCGTGT GGATATGTTT TTACAAGATA ACGGCCGCAT TGTACTGAAC 1000 

4 0 GAAGTCAATA CTCTGCCCGG TTTCACGTCA TACAGTCGTT ATCCCCGTAT 1050 

GATGGCCGCT GCAGGTATTG CACTTCCCGA ACTGATTGAC CGCTTGATCG 1100 

TATTAGCGTT AAAGGGGTGA TAAGCATGGA AATAGGATTT ACTTTTTTAG 1150 

ATGAAATAGT ACACGGTGTT CGTTGGGACG CTAAATATGC CACTTGGGAT 1200 

AATTTCACCG GAAAACCGGT TGACGGGTAT GAAAGTAAAT CGCATTGTAG 1250 

4 5 GGACATTCGA ATT 1263 



2) INFORMATION FOR SEQ ID NO: 1054 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1232 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
60 (A) ORGANISM: Enterococcus faecalis 

596 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(B) STRAIN: R610 



PCT/CA00/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1054 

5 CGGCCTATTA TNCCGAGCAA GCGTTGCGTG ATACCGTTGA AAAAACCATT 50 

AAAAACTGTT TGGATTTTGA AAGGAGACAG GAGCATGAAT AGAATAAAAG 100 

TTGCAATACT GTTTGGGGGT TGCTCAGAGG AGCATGACGT ATCGGTAAAA 150 

TCTGCAATAG AGATAGCCGC TAACATTAAT AAAGAAAAAT ACGAGCCGTT 2 00 

ATACATTGGA ATTACGAAAT CTGGTGTATG GAAAATGTGC GAAAAACCTT 250 

10 GCGCGGAATG GGAAAACGAC AATTGCTATT CAGCTGTACT CTCGCCGGAT 300 

AAAAAAATGC ACGGATTACT TGTTAAAAAG AACCATGAAT ATGAAATCAA 350 

CCATGTTGAT GTAGCATTTT CAGCTTTGCA TGGCAAGTCA GGTGAAGATG 400 

GATCCATACA AGGTCTGTTT GAATTGTCCG GTATCCCTTT TGTAGGCTGC 450 

GATATTCAAA GCTCAGCAAT TTGTATGGAC AAATCGTTGA CATACATCGT 500 

15 TGCGAAAAAT GCTGGGATAG CTACTCCCGC CTTTTGGGTT ATTAATAAAG 550 

ATGATAGGCC GGTGGCAGCT ACGTTTACCT ATCCTGTTTT TGTTAAGCCG 600 

GCGCGTTCAG GCTCATCCTT CGGTGTGAAA AAAGTCAATA GCGCGGACGA 650 

ATTGGACTAC GCAATTGAAT CGGCAAGACA ATATGACAGC AAAATCTTAA 700 

TTGAGCAGGC TGTTTCGGGC TGTGAGGTCG GTTGTGCGGT ATTGGGAAAC 750 

20 AGTGCCGCGT TAGTTGTTGG CGAGGTGGAC CAAATCAGGC TGCAGTACGG 800 

AATCTTTCGT ATTCATCAGG AAGTCGAGCC GGAAAAAGGC TCTGAAAACG 850 

CAGTTATAAC CGTTCCCGCA GACCTTTCAG CAGAGGAGCG AGGACGGATA 900 

CAGGAAACGG CAAAAAAAAT ATATAAAGCG CTCGGCTGTA GAGGTCTAGC 950 

CCGTGTGGAT ATGTTTTTAC AAGATAACGG CCGCATTGTA CTGAACGAAG 1000 

25 TCAATACTCT GCCCGGTTTC ACGTCATACA GTCGTTATCC CCGTATGATG 1050 

GCCGCTGCAG GTATTGCACT TCCCGAACTG ATTGACCGCT TGATCGTATT 1100 

AGCGTTAAAG GGGTGATAAG CATGGAAATA GGATTTACTT TTTTAGATGA 1150 

AATAGTACAC GGTGTTCGTT GGGACGCTAA ATATGCCACT TGGGATAATT 1200 

TCACCGGAAA ACCGGTTGAC GGTTATAAGT AA 1232 

30 



2) INFORMATION FOR SEQ ID NO: 1055 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1218 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 
45 (B) STRAIN: R684 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1055 

TACCGAGCAA GCGTTGCGTG ATACCGTTGA AAAAACCATT AAAAACTGTT 50 

50 TGGATTTTGA AAGGAGACAG GAGCATGAAT AGAATAAAAG TTGCAATACT 100 

GTTTGGGGGT TGCTCAGAGG AGCATGACGT ATCGGTAAAA TCTGCAATAG 150 

AGATAGCCGC TAACATTAAT AAAGAAAAAT ACGAGCCGTT ATACATTGGA 200 

ATTACGAAAT CTGGTGTATG GAAAATGTGC GAAAAACCTT GCGCGGAATG 250 

GGAAAACGAC AATTGCTATT CAGCTGTACT CTCGCCGGAT AAAAAAATGC 300 

55 ACGGATTACT TGTTAAAAAG AACCATGAAT ATGAAATCAA CCATGTTGAT 350 

GTAGCATTTT CAGCTTTGCA TGGCAAGTCA GGTGAAGATG GATCCATACA 400 

AGGTCTGTTT GAATTGTCCG GTATCCCTTT TGTAGGCTGC GATATTCAAA 4 50 

GCTCAGCAAT TTGTATGGAC AAATCGTTGA CATACATCGT TGCGAAAAAT 500 

GCTGGGATAG CTACTCCCGC CTTTTGGGTT ATTAATAAAG ATGATAGGCC 550 

60 GGTGGCAGCT ACGTTTACCT ATCCTGTTTT TGTTAAGCCG GCGCGTTCAG 600 

597' 



SUBSTITUTE SHEET (RULE 26) 
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GCTCATCCTT CGGTGTGAAA AAAGTCAATA GCGCGGACGA ATTGGACTAC 650 

GCAATTGAAT CGGCAAGACA ATATGACAGC AAAATCTTAA TTGAGCAGGC 700 

TGTTTCGGGC TGTGAGGTCG GTTGTGCGGT ATTGGGAAAC AGTGCCGCGT 750 

TAGCTGTTGG CGAGGTGGAC CAAATCAGGC TGCAGTACGG AATCTTTCGT 800 

5 ATTCATCAGG AAGTCGAGCC GGAAAAAGGC TCTGAAAACG CAGTTATAAC 850 

CGTTCCCGCA GACCTTTCAG CAGAGGAGCG AGGACGGATA CAGGAAACGG 900 

CAAAAAAAAT ATATAAAGCG CTCGGCTGTA GAGGTCTAGC CCGTGTGGAT 950 

ATGTTTTTAC AAGATAACGG CCGCATTGTA CTGAACGAAG TCAATACTCT 1000 

GCCCGGTTTC ACGTCATACA GTCGTTATCC CCGTATGATG GCCGCTGCAG 1050 

10 GTATTGCACT TCCCGAACTG ATTGACCGCT TGATCGTATT AGCGTTAAAG 1100 

GGGTGATAAG CATGGAAATA GGATTTACTT TTTTAGATGA AATAGTACAC 1150 

GGTGTTCGTT GGGACGCTAA ATATGCCACT TGGGATAATT TCACCGGAAA 1200 

ACCGGTTGAC GGTTAGAA 1218 

15 

2) INFORMATION FOR SEQ ID NO: 1056 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1265 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R688 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1056 



AATCACACCG CATACGGCCT ATTATACCGA GCAAGCGTTG CGTGATACCG 50 

TTGAAAAAAC CATTAAAAAC TGTTTGGATT TTGAAAGGAG ACAGGAGCAT 100 

35 GAATAGAATA AAAGTTGCAA TACTGTTTGG GGGTTGCTCA GAGGAGCATG 150 

ACGTATCGGT AAAATCTGCA ATAGAGATAG CCGCTAACAT TAATAAAGAA 200 

AAATACGAGC CGTTATACAT TGGAATTACG AAATCTGGTG TATGGAAAAT 250 

GTGCGAAAAA CCTTGCGCGG AATGGGAAAA CGACAATTGC TATTCAGCTG 300 

TACTCTCGCC GGATAAAAAA ATGCACGGAT TACTTGTTAA AAAGAACCAT 350 

40 GAATATGAAA TCAACCATGT TGATGTAGCA TTTTCAGCTT TGCATGGCAA 400 

GTCAGGTGAA GATGGATCCA TACAAGGTCT GTTTGAATTG TCCGGTATCC 4 50 

CTTTTGTAGG CTGCGATATT CAAAGCTCAG CAATTTGTAT GGACAAATCG 500 

TTGACATACA TCGTTGCGAA AAATGCTGGG ATAGCTACTC CCGCCTTTTG 550 

GGTTATTAAT AAAGATGATA GGCCGGTGGC AGCTACGTTT ACCTATCCTG 600 

45 TTTTTGTTAA GCCGGCGCGT TCAGGCTCAT CCTTCGGTGT GAAAAAAGTC 650 

AATAGCGCGG ACGAATTGGA CTACGCAATT GAATCGGCAA GACAATATGA 700 

CAGCAAAATC TTAATTGAGC AGGCTGTTTC GGGCTGTGAG GTCGGTTGTG 750 

CGGTATTGGG AAACAGTGCC GCGTTAGCTG TTGGCGAGGT GGACCAAATC 800 

AGGCTGCAGT ACGGAATCTT TCGTATTCAT CAGGAAGTCG AGCCGGAAAA 850 

50 AGGCTCTGAA AACGCAGTTA TAACCGTTCC CGCAGACCTT TCAGCAGAGG 900 

AGCGAGGACG GATACAGGAA ACGGCAAAAA AAATATATAA AGCGCTCGGC 950 

TGTAGAGGTC TAGCCCGTGT GGATATGTTT TTACAAGATA ACGGCCGCAT 1000 

TGTACTGAAC GAAGTCAATA CTCTGCCCGG TTTCACGTCA TACAGTCGTT 1050 

ATCCCCGTAT GATGGCCGCT GCAGGTATTG CACTTCCCGA ACTGATTGAC 1100 

55 CGCTTGATCG TATTAGCGTT AAAGGGGTGA TAAGCATGGA AATAGGATTT 1150 

ACTTTTTTAG ATGAAATAGT ACACGGTGTT CGTTGGGACG CTAAATATGC 1200 

CACTTGGGAT AATTTCACCG GAAAACCGGT TGACGGTTAT GAAGTAAATC 1250 

GCATTGTAGG GACAT 1265 

60 

598 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



2) INFORMATION FOR SEQ ID NO: 1057 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 12 69 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 

(B) STRAIN: R689 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1057 



TCACACCGCA TACGGCCTAT TATACCGAGC AAGCGTTGCG TGATACCGTT 50 

GAAAAAACCA TTAAAAACTG TTTGGATTTT GAAAGGAGAC AGGAGCATGA 100 

20 ATAGAATAAA AGTTGCAATA CTGTTTGGGG GTTGCTCAGA GGAGCATGAC 150 

GTATCGGTAA AATCTGCAAT AGAGATAGCC GCTAACATTA ATAAAGAAAA 200 

ATACGAGCCG TTATACATTG GAATTACGAA ATCTGGTGTA TGGAAAATGT 250 

GCGAAAAACC TTGCGCGGAA TGGGAAAACG ACAATTGCTA TTCAGCTGTA 300 

CTCTCGCCGG ATAAAAAAAT GCACGGATTA CTTGTTAAAA AGAACCATGA 350 

25 ATATGAAATC AACCATGTTG ATGTAGCATT TTCAGCTTTG CATGGCAAGT 400 

CAGGTGAAGA TGGATCCATA CAAGGTCTGT TTGAATTGTC CGGTATCCCT 450 

TTTGTAGGCT GCGATATTCA AAGCTCAGCA ATTTGTATGG ACAAATCGTT 500 

GACATACATC GTTGCGAAAA ATGCTGGGAT AGCTACTCCC GCCTTTTGGG 550 

TTATTAATAA AGATGATAGG CCGGTGGCAG CTACGTTTAC CTATCCTGTT 600 

30 TTTGTTAAGC CGGCGCGTTC AGGCTCATCC TTCGGTGTGA AAAAAGTCAA 650 

TAGCGCGGAC GAATTGGACT ACGCAATTGA ATCGGCAAGA CAATATGACA 700 

GCAAAATCTT AATTGAGCAG GCTGTTTCGG GCTGTGAGGT CGGTTGTGCG 750 

GTATTGGGAA ACAGTGCCGC GTTAGCTGTT GGCGAGGTGG ACCAAATCAG 800 

GCTGCAGTAC GGAATCTTTC GTATTCATCA GGAAGTCGAG CCGGAAAAAG 850 

35 GCTCTGAAAA CGCAGTTATA ACCGTTCCCG CAGACCTTTC AGCAGAGGAG 900 

CGAGGACGGA TACAGGAAAC GGCAAAAAAA ATATATAAAG CGCTCGGCTG 950 

TAGAGGTCTA GCCCGTGTGG ATATGTTTTT ACAAGATAAC GGCCGCATTG 1000 

TACTGAACGA AGTCAATACT CTGCCCGGTT TCACGTCATA CAGTCGTTAT 1050 

CCCCGTATGA TGGCCGCTGC AGGTATTGCA CTTCCCGAAC TGATTGACCG 1100 

40 CTTGATCGTA TTAGCGTTAA AGGGGTGATA AGCATGGAAA TAGGATTTAC 1150 

TTTTTTAGAT GAAATAGTAC ACGGTGTTCG TTGGGACGCT AAATATGCCA 1200 

CTTGGGATAA TTTCACCGGA AAACCGGTTG ACGGTTATGA AGTAAATCGC 1250 

ATTGTAGGGA CATACGAAT 1269 

45 

2 ) INFORMATION FOR SEQ ID NO: 1058 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 1169 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R420 

60 

599 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01150 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1058 

CAAATTTTCT TTTCTTTTCC TAGGTACACT GAATGTAACC TTAAAAGAAA 50 

AAAGGAAAGG AAGAAAATGA TGAAAAAAAT TGCCGTTTTA TTTGGAGGGA 100 

5 ATTCTCCAGA ATACTCAGTG TCACTAACCT CAGCAGCAAG TGTGATCCAA 150 

GCTATTAACC CGCTGAAATA TGAAGTAATG ACCATTGGCA TCGCACCAAC 200 

AATGGATTGG TATTGGTATC AAGGAAACCT CGCGAATGTT CGCAATGATA 250 

CTTGGCTAGA AGATCACAAA AACTGTCACC AGCTGACTTT TTCTAGCCAA 3 00 

GGATTTATAT TAGGAGAAAA ACGAATCGTC CCTGATGTCC TCTTTCCAGT 350 

10 CTTGCATGGG AAGTATGGCG AGGATGGCTG TATCCAAGGA CTGCTTGAAC 4 00 

TAATGAACCT GCCTTATGTT GGTTGCCATG TCGCTGCCTC CGCATTATGT 450 

ATGAACAAAT GGCTCTTGCA TCAACTTGCT GATACCATGG GAATCGCTAG 500 

TGCTCCCACT TTGCTTTTAT CCCGCTATGA AAACGATCCT GCCACAATCG 550 

ATCGTTTTAT TCAAGAC CAT GGATTCCCGA TCTTTATCAA GCCGAATGAA 600 

15 GCCGGTTCTT CAAAAGGGAT CACAAAAGTA AC TGAC AAAA CAGCGCTCCA 650 

ATCTGCATTA ACGACTGCTT TTGCTTACGG TTCTACTGTG TTGATCCAAA 700 

AGGCGATAGC GGGTATTGAA ATTGGCTGCG GCATCTTAGG AAATGAGCAA 750 

TTGACGATTG GTGCTTGTGA TGCGATTTCT CTTGTCGACG GTTTTTTTGA 800 

TTTTGAAGAG AAATACCAAT TAATCAGCGC CACGATCACT GTCCCAGCAC 850 

20 CATTGCCTCT CGCGCTTGAA TCACAGATCA AGGAGCAGGC ACAGCTGCTT 900 

TATCGAAACT TGGGATTGAC GGGTCTGGCT CGAATCGATT TTTTCGTCAC 950 

CAATCAAGGA GCGATTTATT TAAACGAAAT CAACACCATG CCGGGATTTA 1000 

CTGGGCACTC CCGCTACCCA GCTATGATGG CGGAAGTCGG GTTATCCTAC 1050 

GAAATATTAG TAGAGCAATT GATTGCACTG GCAGAGGAGG ACAAACGATG 1100 

25 AACACATTAC AATTGATCAA TAAAAACCAT CCATTGAAAA AAAATCAAGA 1150 

GCCCCCGCAC TTAGTGCTA 1169 



30 2) INFORMATION FOR SEQ ID NO: 1059 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1166 bases 1 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R631 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1059 



CAAATTTTCT TTTCTTTTCC TAGGTACACT GAATGTAACC TTAAAAGAAA 50 

AAAGGAAAGG AAGAAAATGA TGAAAAAAAT TGCCGTTTTA TTTGGAGGGA 100 

ATTCTCCAGA ATACTCAGTG TCACTAGCCT CAGCAGCAAG TGTGATCCAA 150 

GCTATTGACC CGCTGAAATA TGAAGTAATG ACCATTGGCA TCGCACCAAC 200 

50 AATGGATTGG TATTGGTATC AAGGAAACCT CGCGAATGTT CGCAATGATA 250 

CTTGGCTAGA AGATCACAAA AACTGTCACC AGCTGACTTT TTCTAGCCAA 300 

GGATTTATAT TAGGAGAAAA ACGAATCGTC CCTGATGTCC TCTTTCCAGT 350 

CTTGCATGGG AAGTATGGCG AGGATGGCTG TATCCAAGGA CTGCTTGAAC 400 

TAATGAACCT GCCTTATGTT GGTTGCCATG TCGCTGCCTC CGCATTATGT 450 

55 ATGAACAAAT GGCTCTTGCA TCAACTTGCT GATACCATGG GAATCGCTAG 500 

TGCTCCCACT TTGCTTTTAT CCCGCTATGA AAACGATCCT GCCACAATCG 550 

ATCGTTTTAT TCAAGACCAT GGATTCCCGA TCTTTATCAA GCCGAATGAA 600 

GCCGGTTCTT CAAAAGGGAT CACAAAAGTA ACTGACAAAA CAGCGCTCCA 650 

ATCTGCATTA ACGACTGCTT TTGCTTACGG TTCTACTGTG TTGATCCAAA 700 

60 AGGCGATAGC GGGTATTGAA ATTGGCTGCG GCATCTTAGG AAATGAGCAA 750 

600 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

TTGACGATTG GTGCTTGTGA TGCGATTTCT 
TTTTGAAGAG AAATACCAAT TAATCAGCGC 
CATTGCCTCT CGCGCTTGAA TCACAGATCA 
TATCGAAACT TGGGATTGAC GGGTCTGGCT 
5 CAATCAAGGA GCGATTTATT TAAACGAAAT 
CTGGGCACTC CCGCTACCCA GCTATGATGG 
GAAATATTAG TAGAGCAATT GATTGCACTG 
AACACATTAC AATTGATCAA TAAAAACCAT 
GCCCCCGCAC TTAGTG 



PCT/CAOO/01150 

CTTGTCGACG GTTTTTTTGA 800 

CACGATCACT GTCCCAGCAC 850 

AGGAGCAGGC ACAGCTGCTT 900 

CGAATCGATT TTTTCGTCAC 950 

CAACACCATG CCGGGATTTA 1000 

CGGAAGTCGG GTTATCCTAC 1050 

GCAGAGGAGG ACAAACGATG 1100 

CCATTGAAAA AAAATCAAGA 1150 

1166 



2) INFORMATION FOR SEQ ID NO: 1060 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 102 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 
25 (B) STRAIN: ATCC 25788 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1060 

AACATGAAAA AAATCGCCTT ATTTTTGGAG GCAATTCACC GGAATACACC 50 

30 GTTTCTTTAG CTTCAGCAAC TAGCGCAATC GAAGCACTCC AATCATCTCC 100 

CTATGACTAC GACCTCTCTT TGATCGGGAT CGCCCCAGAT GCTATGGATT 150 

GGTACTTGTA TACAGGAGAA CTGGAAAACA TCCGACAAGA CACGTGGTTG 2 00 

TTGGATACGA AACATAAACA GAAAATACAG CCGCTATTCG AAGGAAACGG 250 

CTTTTGGCTA AGTGAAGAGC AGCAAACGTT GGTACCTGAT GTTTTATTTC 3 00 

35 CCATTATGCA TGGCAAATAC GGGGAAGATG GCAGTATCCA AGGATTGTTT 350 

GAATTGATGA AGCTGCCTTA TGTAGGCTGC GGGGTGGCAG GTTCTGCCTT 400 

ATGTATGAAC AAATGGCTGC TGCATCAAGC TGCAGCAGCC ATTGGCGTAC 450 

AAAGTGCTCC TACGATTCTC TTGACAAATC AAGCCAACCA GCAAGAACAA 500 

ATCGAAGCTT TTATCCAGAC CCATGGCTTC CCAGTTTTCT TTAAGCCTAA 550 

40 TGAAGCGGGC TCCTCAAAAG GGATCACTAA AGTCACCTGC GTTGAAGAAA 600 

TCGCTTCTGC CTTAAAAGAA GCCTTTACTT ATTGTTCCGC AGTGCTCCTA 650 

CAAAAAAATA TTGCCGGTGT TGAGATCGGT TGCGGTATTT TGGGCAACGA 700 

CTCTTTGACT GTCGGTGCTT GTGACGCCAT TTCATTAGTA GACGGCTTTT 750 

TCGATTTTGA AGAAAAGTAC CAGCTGATCA GCGCCAAAAT CACCGTCCCT 800 

45 GCGCCATTGC CTGAAACGAT TGAAACCAAG GTCAAAGAAC AAGCTCAGCT 850 

GCTCTATCGT AGTCTTGGTC TTAAAGGTCT TGCTCGCATC GACTTTTTTG 900 

TCACGGAGCG AGGAGAACTA TACTTGAATG AAATCAATAC TATGCCGGGC 950 

TTTACGAGTC ACTCCCGCTA TCCTGCCATG ATGGCAGCGG TCGGCTTATC 1000 

CTATCAAGAA CTACTACAAA AACTGCTT 1028 

50 



2) INFORMATION FOR SEQ ID NO: 1061 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



60 



601 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(ii) MOLECULE TYPE: Genomic DNA 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: R689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1061 

AATATGAAAA AAATCGCCTT ATTTTTGGAG GCAATTCACC GGAGTACGCC 50 

GTTTCTTTAG CCTCAGCAAC TAGCGCAATC GAAGCACTCC AATCATCTCC 100 

CGATGACTAT GACCTCTCTT TGATCGGGAT CGCCCCAGAT GCTATGGATT 150 

GGTATTTGTA TACAGGAGAA CTGGAAAACA TCCGACAAGA CACGTGGTTG 200 

TTGGATACGA AACATAAACA GAAAATCCAG CCGCTTTTTG AAGGAAACGG 250 

CTTTTGGCTA AGTGAAGAGC AACAAACGTT GGTTCCTGAT GTTTTATTTC 300 

CCATTATGCA TGGCAAATAC GGGGAAGATG GCAGTATCCA AGGATTGTTT 350 

GAATTGATGA AACTACCTTA TGTAGGCTGC GGGGTGGCAG CCTCTGCCTT 400 

ATGTATGAAC AAATGGCTGC TGCATCAAGC AGCAGAAGCG ATTGGCGTAC 450 

AAAGTGCTCC TACGATTCTC TTGACAAATC AAGCCAACCA GCAAGATCAA 500 

ATCGAAGCTT TTATCCAGAC CCATGGCTTT CCGGTTTTTT TTAAGCCTAA 550 

TGAAGCGGGC TCCTCAAAAG GGATCACTAA AGTCACCTGC GTTGAAGAAA 600 

TCGCTTCTGC CTTAAAAGAA GCCTTTACTT ATTGTTCAGC AGTGCTCCTA 650 

CAAAAAAATA TTGCCGGTGT TGAGATCGGT TGCGGTATTT TGGGCAACGA 700 

CTCTTTGACT GTCGGTGCTT GTGACGCTAT TTCATTAGTA GACGGCTTTT 750 

TCGATTTTGA AGAAAAGTAC CAGCTGATCA GCGC CAAGAT CACCGTTCCT 800 

GCACCATTGC CTGAAACGAT TGAAACCAAG GTCAAAGAAC AAGCTCAGCT 850 

GCTCTATCGT AGTCTTTGTC TTAAAGGTCT TGCTCGCATC GACTTTTTTG 900 

TCACGGATCA AGGAGAACTA TACTTGAATG AAATCAATAC TATGCCGGGC 950 

TTTACGAGTC ACTCCCGCTA TCCTGCCATG ATGGCAGCGA TCGGCTTATC 1000 

CTATCAAGAA CTACTACAAA AACTGCTTGT 1030 



2) INFORMATION FOR SEQ ID NO: 1062 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1031 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: R754 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1062 

AAACATGAAA AAAATCGCCA TTATTTTTGG AGGCAATTCA CCGGAATACA 50 

CCGTTTCTTT AGCTTCAGCA ACTAGCGCAA TCGAAGCACT CCAATCATCT 100 

CCCTATGACT ACGACCTCTC TTTGATCGGG ATCGCCCCAG ATGCTATGGA 150 

TTGGTACTTG TATACAGGAG AACTGGAAAA CATCCGACAA GACACGTGGT 200 

TGTTGGATAC GAAACATAAA CAGAAAATAC AGCCGCTATT CGAAGGAAAC 250 

GGCTTTTGGC TAAGTGAAGA GCAGCAAACG TTGGTACCTG ATGTTTTATT 300 

TCCCATTATG CATGGCAAAT ACGGGGAAGA TGGCAGTATC CAAGGATTGT 350 

TTGAATTGAT GAAGCTGCCT TATGTAGGCT GCGGGGTGGC AGGTTCTGCC 4 00 

TTATGTATGA ACAAATGGCT GCTGCATCAA GCTGCAGCAG CCATTGGCGT 450 

ACAAAGTGCT CCTACGATTC TCTTGACAAA TCAAGCCAAC CAGCAAGAAC 500 

AAATCGAAGC TTTTATCCAG ACCCATGGCT TCCCAGTTTT CTTTAAGCCT 550 

AATGAAGCGG GCTCCTCAAA AGGGATCACT AAAGTCACCT GCGTTGAAGA 600 

602 
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AATCGCTTCT GCCTTAAAAG 
TACAAAAAAA TATTGCCGGT 
GACTCTTTGA CTGTCGGTGC 
TTTCGATTTT GAAGAAAAGT 
CTGCGCCATT GCCTGAAACG 
CTGCTCTATC GTAGTCTTGG 
TGTCACGGAG CGAGGAGAAC 
GCTTTACGAG TCACTCCCGC 
TCCTATCAAG AACTACTACA 



AAGCCTTTAC TTATTGTTCC GCAGTGCTCC 
GTTGAGATCG GTTGCGGTAT TTTGGGCAAC 
TTGTGACGCC ATTTCATTAG TAGACGGCTT 
ACCAGCTGAT CAGCGCCAAA ATCACCGTCC 
ATTGAAACCA AGGTCAAAGA ACAAGCTCAG 
TCTTAAAGGT CTTGCTCGCA TCGACTTTTT 
TATACTTGAA TGAAATCAAT ACTATGCCGG 
TATCCTGCCA TGATGGCAGC GGTCGGCTTA 
AAAACTGCTT G 
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2 ) INFORMATION FOR SEQ ID NO: 1063 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: R775 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1063 

AACATGAAAA AAATCGCCAT TATTTTTGGA GGCAATTCAC CGGAATACAC 50 

CGTTTCTTTA GCTTCAGCAA CTAGCGCAAT CGAAGCACTC CAATCATCTC 100 

CCTATGACTA' CGACCTCTCT TTGATCGGGA TCGCCCCAGA TGCTATGGAT 150 

TGGTACTTGT ATACAGGAGA ACTGGAAAAC ATCCGACAAG ACACGTGGTT 2 00 

GTTGGATACG ' AAAC ATAAAC AGAAAATACA GCCGCTATTT GAAGGAAACG 250 

GCTTTTGGCT AAGTGAAGAG CAGCAAACGT TAGTACCTGA TATTTTATTT 300 

CCCATTATGC ATGGCAAATA CGGGGAAGAT GGCAGTATCC AAGGATTGTT 350 

TGAATTGATG AAACTACCTT ATGTAGGTTG CGGGGTGGCA GGTTCTGCCT 400 

TATGTATGAA CAAATGGCTG CTGCATCAAG CTGCAGCAGC CATTGGCGTA 450 

CAAAGTGCTC CTACGATTCT CTTGACAAAT CAAGCCAACC AGCAAGAACA 500 

AATCGAAGCT TTTATCCAGA CCCATGGCTT CCCAGTTTTC TTTAAGCCTA 550 

ATGAAGCGGG CTCTTCAAAA GGGATCACTA AAGTCACCTG CGTTGAAGAA 600 

ATCGCTTCTG CCTTAAAAAA AGCCTTTACT TATTGTTCCG CAGTGCTCCT 650 

ACAAAAAAAT ATTGCCGGTG TTGAGATCGG TTGCGGTATT TTGGGCAACG 700 

ACTCTTTGAC TGTCGGTGCT TGTGACGCCA TTTCATTAGT AGACGGCTTT 750 

TTCGATTTTG AAGAAAAGTA CCAGCTGATC AGCGCCAAAA TCACCGTCCC 800 

TGCGCCATTG CCTGAAACGA TTGAAACCAA GGTCAAAGAA CAAGCTCAGC 850 

TGCTCTATCG TAGTCTTGGT CTTAAAGGTC TTGCTCGCAT CGACTTTTTT 900 

GTCACGGATC AAGGAGAACT ATACTTGAAT GAAATCAATA CTATGCCGGG 950 

CTTTACGAGT CACTCCCGTT ATCCTGCCAT GATGGCAGCG GTCGGCTTAT 1000 

CCTATCAAGA ACTACTACAA AAACTGCTTG 1030 



2) INFORMATION FOR SEQ ID NO: 1064 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



603 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(ii) MOLECULE TYPE: Genomic DNA 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 
5 (B) STRAIN: ATCC 49996 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1064 

AAACATGAAA AAAATCGCCA TTATTTTTGG AGGCAATTCA CCGGAATACA 50 

L0 CCGTTTCTTT AGCTTCAGCA ACTAGCGCAA TCGAAGCACT CCAATCATCT 100 

CCCTATGACT ACGACCTCTC TTTGATCGGG ATCGCCCCAG ATGCTATGGA 150 

TTGGTACTTG TATACAGGAG AACTGGAAAA CATCCGACAA GACACGTGGT 2 00 

TGTTGGATAC GAAACAGAAA CAGAAAATAC AGCCGCTATT CGAAGGAAAC 250 

GGCTTTTGGT TAAGTGAAGA GCAGCAAACG TTGGTACCTG ATGTTTTATT 3 00 

L5 TCCCATTATG CATGGCAAAT ACGGGGAAGA TGGCAGTATC CAAGGATTGT * 350 

TTGAATTGAT GAAGCTACCT TATGTAGGCT GCGGGGTGGC AGGTTCTGCC 400 

TTATGTATGA ACAAATGGTT GCTGCATCAA GCTGCAGCAG CCATTGGCGT 450 

ACAAAGTGCT CCTACGATTC TCTTGACAAA TCACGCCAAC CAGCAAGAAC 500 

AAATCGAAGC TTTTATCCAG ACCCATGGCT TTCCAGTTTT CTTTAAGCCT 550 

20 AATGAAGCGG GTTCCTCAAA AGGGATCACT AAAGTCACCT GCGTTGAAGA 600 

AATCGCTTCT GCCTTAAAAG AAGCCTTTAC TTATTGTTCC GCAGTGCTCC 650 

TACAAAAAAA TATTGCCGGT GTTGAGATCG GTTGCGGTAT TTTGGGCAAC 700 

GACTCTTTGA CTGTCGGTGC TTGTGACGCC ATTTCATTAG TAGACGGCTT 750 

TTTCGATTTT GAAGAAAAGT ACCAGCTGAT CAGCGCCAAA ATCACCGTCC 800 

25 CTGCGCCATT GCCTGAAACG ATTGAAACTA AGGTCAAAGA ACAAGCTCAG 850 

CTGCTCTATC GTAGTCTTGG ACTTAAAGGT CTTGCTCGCA TCGACTTTTT 900 

TGTCACGGAT CAAGGAGAAC TATACTTGAA TGAAATCAAT ACTATGCCGG 950 

GCTTTACGAG TCACTCCCGC TATCCTGCCA TGATGGCAGC GGTCGGGTTA 1000 

TCCTATCAAG AACTACTACA AAAACTACTT GT 1032 

30 



2) INFORMATION FOR SEQ ID NO<: 1065 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1034 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 
45 (B) STRAIN: R758 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1065 

AAAAACATGA AAAAAATCGC CATTATTTTT GGAGG CAATT CACCGGAATA 50 

50 CACCGTTTCT TTAGCTTCAG CAACTAGCGC AATCGAAGCA CTCCAATCAT 100 

CTCCCTATGA CTACGACCTC TCTTTGATCG GGATCGCCCC AGATGCTATG 150 

GATTGGTACT TGTATACAGG AGAACTGGAA AACATCCGAC AAGACACGTG 2 00 

GTTGTTGGAT ACGAAACATA AACAGAAAAT ACAGCCGCTA TTCGAAGGAA 250 

ACGGCTTTTG GCTAAGTGAA GAGCAGCAAA CGTTGGTACC TGATGTTTTA 300 

55 TTTCCCATTA TGCATGGCAA ATACGGGGAA GATGGCAGTA TCCAAGGATT 350 

GTTTGAATTG ATGAAGCTGC CTTATGTAGG CTGCGGGGTG GCAAGTTCTG 400 

CCTTATGTAT GAACAAATGG CTGCTGCATC AAGCTGCAGC AGCCATTGGC 450 

GTACAAAGTG CTCCTACGAT TCTCTTGACA AATCAAGCCA ACCAGCAAGA 500 

ACAAATCGAA GCTTTTATCC AGACCCATGG CTTTCCAGTT TTCTTTAAGC 550 

60 CTAATGAAGC GGGCTCCTCA AAAGGGATCA CTAAAGTCAC CTGCGTTGAA 600 

604 
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GAAATCGCTT CTGCCTTAAA AGAAGCCTTT 
CCTACAAAAA AATATTGCCG GTGTTGAGAT 
ACGACTCTTT GACTGTCGGT GCTTGTGACG 
TTTTTCGATT TTGAAGAAAA GTACCAGCTG 
5 CCCTGCGCCA TTGCCTGAAA CGATTGAAAC 
AGCTGCTCTA TCGTAGTCTT GGTCTTAAAG 
TTTGTCACGG ATCAAGGAGA ACTATACTTG 
GGGCTTTACG AGTCACTCCC GCTATCCTGC 
TATCCTATCA AGAACTACTA CAAAAACTGC 



PCT/CA00/01150 

ACTTATTGTT CCGCAGTGCT 650 

CGGTTGCGGT ATTTTGGGCA 700 

CCATTTCATT AGTAGACGGC 750 

ATCAGCGCCA AAATCACCGT 800 

CAAGGTCAAA GAACAAGCTC 850 

GTCTTGCTCG CATCGACTTT 900 

AATGAAATCA ATACTATGCC 950 

CATGATGGCA GCGGTCGGCT 1000 

TTGT 1034 



2) INFORMATION FOR SEQ ID NO: 1066 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1012 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus flavescens 
25 (B) STRAIN: R760 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1066 

CATGAAAAAA ATCGCCATTA TTTTTGGAGG CAATTCACCG GAATACACCG 50 

30 TTTCTTTAGC CTCAGCAACT AGCGCAATCG AAGCACTCCA ATCATCTCCC 100 

TATGATTACG ACCTCTCTTT GATCGGGATC GCCCCAGATG CTATGGATTG 150 

GTACTTGTAT ACAGGAGAAC TGGAAAACAT CCGACAAGAC ACGTGGTTGT 2 00 

TGGATACGAA ACATACACAG AAAATCCAGC CACTTTTTGA AGGAAACGGC 250 

TTTTGGATAA GTGAAGCGCA GCAAACGTTG GTACCTGATG TTTTATTTCC 300 

35 CATTATGCAT GGTAAATACG GGGAAGATGG CAGTATCCAA GGATTGTTTG 350 

AATTGATGAA GCTGCCTTAT GTAGGCTGTG GGGTGGCAGC CTCTGCCTTA 400 

TGTATGAACA AATGGTTATT GCATCAAGCA GCAGCAGCGA TTGGCGTACA 450 

AAGCGCTCCT ACGATTCTCT TGACAAATCA AGCCAACCAG CAAAGACAAA 500 

TCGAAGCCTT TATCCAGACC CATGGCTTTC CAGTTTTCTT TAAGCCTAAT 550 

40 GAAGCGGGCT CCTCAAAAGG GATCACAAAA GTAACTTGTG TTGAAGAAAT 600 

CGCTCCTGCC TTGAAGGAAG CCTTCGCTTA TTGTTCCGCA GTGCTCTTAC 650 

AAAAAAATAT CGCTGGCGTT GAGATTGGTT GCGGTATCTT AGGCAACGAC 700 

TCTTTGACTG TCGGTGCTTG TGACGCTATT TCATTAGTAG ACGGCTTTTT 750 

CGATTTTGAA GAAAAGTACC AGTTGATCAG CGCCAAAATC ACCGTTCCTG 800 

45 CGCCATTGCC TGAAACGATT GAAACCAAAG TCAAAGAACA AGCTCAGCTG 850 

CTCTATCACA GTCTTGGTCT TAAAGGACTT GCTCGCATCG ACTTTTTTGT 900 

CACGGATCAA GGAGAACTAT ACTTGAATGA AATCAATACT ATGCCGGGCT 950 

TTACGAGTCA CTCCCGCTAT CCTGCCATGA TGGCAGCGGT CGGCTTATCC 1000 

TATCAAGAAT TA 1012 

50 



2) INFORMATION FOR SEQ ID NO: 1067 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 721 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

60 

605 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
5 (B) STRAIN: R481 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1067 

CTTACGCTTT ATCGATTAGA CACGGGNAGC TTGTCCAATG GGRAGCCGAT 50 

10 TTGATTTTAT GGATGAACGC TCTCATCATG CGGCAAATGG AATATCATGC 100 

AATGAAGCGC AAAATCGCAG ACGTTTGCGC TCCATCATGG AAAACAGTGG 150 

GTTTGAAGCA TATAGCCTCG AATGGTGGCA CTATGTATTA AGAGACGAAC 2 00 

CATACCCCAA TAGCTATTTT GATTTCCCCG TTAAATAAAC TTTTAACCGT 2 50 

TGCACGGACA AACTATATAA GCTAACTCTT TCGGCAGGAA ACCCGACGTA 300 

15 TGTAACTGGT TCTTAGGGAA TTTATATATA GTAGATAGTA TTGAAGATGT 3 50 

AAGGCAGAGC GATATTGCGG TCATTATCTG CGTGCGCTGC GGCAAGATAG 4 00 

CCTGATAATA AGACTGATCG CATAGAGGGG TGGTATTTCA CACCGCCCAT 450 

TGTCAACAGG CAGTTCAGCC TCGTTAAATT CAGCATGGGT ATCACTTATG 500 

AAAATTCATC TACATTGGTG ATAATAGTAA ATCCAGTAGG GCGAAATAAT 550 

20 TGACTGTAAT TTACGGGGCA AAACGGCACA ATCTCAAACG AGATTGTGCC 600 

GTTTAAGGGG AAGATTCTAG AAATATTTCA TACTTCCAAC TATATAGTTA 650 

AGGAGGAGAC TGAAAATGAA GAAGTTGTTT TTTTTATTGT TATTGTTATT 700 

CTTAATATAC TTAGGTTATT G 721 



25 



40 



2) INFORMATION FOR SEQ ID NO: 1068 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 668 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R492 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1068 



ATTTTTAAGG ATGAACGCTC TTCATCATGC GGCAAATGGA ATATCATGCA 50 

ATGAAGCGCA AAATCGCAGA CGTTTGCGCT CCATCATGGA AAACAGTGGG 100 

45 TTTGAAGCAT ATAGCCTCGA ATGGTGGCAC TATGTATTAA GAGACGAACC 150 

ATACCCCAAT AGCTATTTTG ATTTCCCCGT TAAATAAACT TTTAACCGTT 200 

GCACGGACAA ACTATATAAG CTAACTCTTT CGGCAGGAAA CCCGACGTAT 250 

GTAACTGGTT CTTAGGGAAT TTATATATAG TAGATAGTAT TGAAGATGTA 300 

AGGCAGAGCG ATATTGCGGT CATTATCTGC GTGCGCTGCG GCAAGATAGC 350 

50 CTGATAATAA GACTGATCGC ATAGAGGGGT GGTATTTCAC ACCGCCCATT 400 

GTCAACAGGC AGTTCAGCCT CGTTAAATTC AGCATGGGTA TCACTTATGA 450 

AAATTCATCT ACATTGGTGA TAATAGTAAA TCCAGTAGGG CGAAATAATT 500 

GACTGTAATT TACGGGGCAA AACGGCACAA TCTCAAACGA GATTGTGCCG 550 

TTTAAGGGGA AGATTCTAGA AATATTTCAT ACTTCCAACT ATATAGTTAA 600 

55 GGAGGAGACT GAAAATGAAG AAGTTGTTTT TTTTATTGTT ATTGTTATTC 650 

TTAATATACT TAGGTTAT 668 



60 2) INFORMATION FOR SEQ ID NO: 1069 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 760 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R581 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1069 



CGGCAAGTGC CATTGATCTT ACGCTTTATC GATTAGACAC GGGTRAGCTT 50 

GTACCAATGG GAAG CCGATT TGATTTTATG GATGAACGCT CTCATCATGC 100 

GGCAAATGGA ATATCATGCA ATGAAGCGCA AAATCGCAGA CGTTTGCGCT 150 

CCATCATGGA AAACAGTGGG TTTGAAGCAT ATAGCCTCGA ATGGTGGCAC 200 

20 TATGTATTAA GAGACGAACC ATACCCCAAT AGCTATTTTG ATTTCCCCGT 250 

TAAATAAACT TTTAACCGTT GCACGGACAA ACTATATAAG CTAACTCTTT 300 

CGGCAGGAAA CCCGACGTAT GTAACTGGTT CTTAGGGAAT TTATATATAG 350 

TAGATAGTAT TGAAGATGTA AGGCAGAGCG ATATTGCGGT CATTATCTGC 400 

GTGCGCTGCG GCAAGATAGC CTGATAATAA GACTGATCGC ATAGAGGGGT 450 

25 GGTATTTCAC ACCGCCCATT GTCAACAGGC AGTTCAGCCT CGTTAAATTC 500 

AGCATGGGTA TCACTTATGA AAATTCATCT ACATTGGTGA TAATAGTAAA 550 

TCCAGTAGGG CGAAATAATT GACTGTAATT TACGGGGCAA AACGGCACAA 600 

TCTCAAACGA GATTGTGCCG TTTAAGGGGA AGATTCTAGA AATATTTCAT 650 

ACTTCCAACT ATATAGTTAA GGAGGAGACT GAAAATGAAG AAGTTGTTTT 700 

30 TTTTATTGTT ATTGTTATTC TTAATATACT TAGGTTATGA CTACGTTAAT 750 

GAAANCCTGA I 760 



35 2 ) INFORMATION FOR SEQ ID NO: 1070 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 801 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: R610 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1070 



AAAAGGAATA CGGGGCTTTC AAAAATCCAA GCCATAACCC GCGGGCAAGT 50 

GCCATTTGAT TCTTACGCTT TAATCGATTA GACACGGGTA AGCTTGTACC 100 

AATGGGGAAC CGATTTGATT TTAATGNATG AACGCTCTTC ATCATGCGGC 150 

AAATGGAATA TCATGCAATG AAGCGCAAAA TCGCAGACGT TTGCGCTCCA 200 

55 TCATGGAAAA CAGTGGGTTT GAAGCATATA GCCTCGAATG GTGGCACTAT 250 

GTATTAAGAG ACGAACCATA CCCCAATAGC TATTTTGATT TCCCCGTTAA 300 

ATAAACTTTT AACCGTTGCA CGGACAAACT ATATAAGCTA ACTCTTTCGG 350 

CAGGAAACCC GACGTATGTA ACTGGTTCTT AGGGAATTTA TATATAGTAG 400 

ATAGTATTGA AGATGTAAGG CAGAGCGATA TTGCGGTCAT TATCTGCGTG 450 

60 CGCTGCGGCA AGATAGCCTG ATAATAAGAC TGATCGCATA GAGGGGTGGT 500 

607 
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ATTTCACACC GCCCATTGTC AACAGGCAGT TCAGCCTCGT TAAATTCAGC 
ATGGGTATCA CTTATGAAAA TTCATCTACA TTGGTGATAA TAGTAAATCC 
AGTAGGGCGA AATAATTGAC TGTAATTTAC GGGGCAAAAC GGCACAATCT 
CAAACGAGAT TGTGCCGTTT AAGGGGAAGA TTCTAGAAAT ATTTCATACT 
TCCAACTATA TAGTTAAGGA GGAGACTGAA AATGAAGAAG TTGTTTTTTT 
TTATTGTTAT TGTTATTCTT AATATACTTA GGTTATGACT ACGTTAATGA 
A 
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2) INFORMATION FOR SEQ ID NO: 1071 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 711 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE : Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: R684 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1071 

TTGTACCAAT GGGGAGCCGA TTTGATTTTA TGGATGAACG CTCTCATCAT 50 

GCGGCAAATG GAATATCATG CAATGAAGCG CAAAATCGCA GACGTTTGCG 100 

CTCCATCATG GAAAACAGTG GGTTTGAAGC ATATAGCCTC GAATGGTGGC 150 

ACTATGTATT AAGAGACGAA CCATACCCCA ATAGCTATTT TGATTTCCCC 200 

GTTAAATAAA CTTTTAACCG TTGCACGGAC AAACTATATA AGCTAACTCT 250 

TTCGGCAGGA AACCCGACGT ATGTAACTGG TTCTTAGGGA ATTTATATAT 30 0 

AGTAGATAGT ATTGAAGATG TAAGGCAGAG CGATATTGCG GTCATTATCT 350 

GCGTGCGCTG CGGCAAGATA GCCTGATAAT AAGACTGATC GCATAGAGGG 4 00 

GTGGTATTTC ACACCGCCCA TTGTCAACAG GCAGTTCAGC CTCGTTAAAT 450 

TCAGCATGGG TATCACTTAT GAAAATTCAT CTACATTGGT GATAATAGTA 500 

AATCCAGTAG GGCGAAATAA TTGACTGTAA TTTACGGGGC AAAACGGCAC 550 

AATCTCAAAC GAGATTGTGC CGTTTAAGGG GAAGATTCTA GAAATATTTC 600 

ATACTTCCAA CTATATAGTT AAGGAGGAGA CTGAAAATGA AGAAGTTGTT 650 

TTTTTTATTG TTATTGTTAT TCTTAATATA CTTAGGTTAT GACTACGTTA 700 

ATGAAGCACT G 711 



2) INFORMATION FOR SEQ ID NO: 1072 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: R688 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1072 



608 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



GCCATTGATC TTACGCTTTA TCGATTAGAC ACGGGTNAGC TTGTACCAAT 50 

GGGGAGCCGA TTTGATTTTA TGGATGAACG CTCTCATCAT GCGGCAAATG 100 

GAATATCATG CAATGAAGCG CAAAATCGCA GACGTTTGCG CTCCATCATG 150 

GAAAACAGTG GGTTTGAAGC ATATAGCCTC GAATGGTGGC ACTATGTATT 200 

5 AAGAGACGAA CCATACCCCA ATAGCTATTT TGATTTCCCC GTTAAATAAA 250 

CTTTTAACCG TTGCACGGAC AAACTATATA AGCTAACTCT TTCGGCAGGA 300 

AACCCGACGT ATGTAACTGG TTCTTAGGGA ATTTATATAT AGTAGATAGT 350 

ATTGAAGATG TAAGGCAGAG CGATATTGCG GTCATTATCT GCGTGCGCTG 400 

CGGCAAGATA GCCTGATAAT AAGACTGATC GCATAGAGGG GTGGTATTTC 450 

10 ACACCGCCCA TTGTCAACAG GCAGTTCAGC CTCGTTAAAT TCAGCATGGG 500 

TATCACTTAT GAAAATTCAT CTACATTGGT GATAATAGTA AATCCAGTAG 550 

GGCGAAATAA TTGACTGTAA TTTACGGGGC AAAACGGCAC AATCTCAAAC 600 

GAGATTGTGC CGTTTAAGGG GAAGATTCTA GAAATATTTC ATACTTCCAA 650 

CTATATAGTT AAGGAGGAGA CTGAAAATGA AGAAGTTGTT TTTTTTATTG 700 

15 TTATTGTTAT TCTTAATATA CTTAGGTTAT GACTACGTTA ATGAAGCACT 750 

G 751 



20 2) INFORMATION FOR SEQ ID NO: 1073 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 685 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcns flavescens 

(B) STRAIN: R689 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1073 



ATTTGATTTT ATGGATGAAC GCTCTCATCA TGCGGCAAAT GGAATATCAT 50 

GCAATGAAGC GCAAAATCGC AGACGTTTGC GCTCCATCAT GGAAAACAGT 100 

GGGTTTGAAG CATATAGCCT CGAATGGTGG CACTATGTAT TAAGAGACGA 150 

ACCATACCCC AATAGCTATT TTGATTTCCC CGTTAAATAA ACTTTTAACC 200 

4 0 GTTGCACGGA CAAACTATAT AAGCTAACTC TTTCGGCAGG AAACCCGACG 250 

TATGTAACTG GTTCTTAGGG AATTTATATA TAGTAGATAG TATTGAAGAT 300 

GTAAGGCAGA GCGATATTGC GGTCATTATC TGCGTGCGCT GCGGCAAGAT 350 

AGCCTGATAA TAAGACTGAT CGCATAGAGG GGTGGTATTT CACACCGCCC 400 

ATTGTCAACA GGCAGTTCAG CCTCGTTAAA TTCAGCATGG GTATCACTTA 450 

4 5 TGAAAATTCA TCTACATTGG TGATAATAGT AAATCCAGTA GGGCGAAATA 500 

ATTGACTGTA ATTTACGGGG CAAAACGGCA . CAATCTCAAA CGAGATTGTG 550 

CCGTTTAAGG GGAAGATTCT AGAAATATTT CATACTTCCA ACTATATAGT 600 

TAAGGAGGAG ACTGAAAATG AAGAAGTTGT TTTTTTTATT GTTATTGTTA 650 

TTCTTAATAT ACTTAGGTTA TGACTACGTT AATGA 685 

50 



2) INFORMATION FOR SEQ ID NO: 1074 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 732 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



609 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
5 (B) STRAIN: R690 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1074 

ATCGATTAGA CACGGGTGAG CTTGTACCAA TGGGGAGCCG ATTTGATTTT 50 

10 ATGGATGAAC GCTCTCATCA TGCGGCAAAT GGAATATCAT GCAATGAAGC 100 

GCAAAATCGC AGACGTTTGC GCTCCATCAT GGAAAACAGT GGGTTTGAAG 150 

CATATAGCCT CGAATGGTGG CACTATGTAT TAAGAGACGA ACCATACCCC 200 

AATAGCTATT TTGATTTCCC CGTTAAATAA ACTTTTAACC GTTGCACGGA 250 

CAAACTATAT AAGCTAACTC TTTCGGCAGG AAACCCGACG TATGTAACTG 3 00 

15 GTTCTTAGGG AATTTATATA TAGTAGATAG TATTGAAGAT GTAAGGCAGA 350 

GCGATATTGC GGTCATTATC TGCGTGCGCT GCGGCAAGAT AGCCTGATAA 400 

TAAGACTGAT CGCATAGAGG GGTGGTATTT CACACCGCCC ATTGTCAACA 4 50 

GGCAGTTCAG CCTCGTTAAA TTCAGCATGG GTATCACTTA TGAAAATTCA 500 

TCTACATTGG TGATAATAGT AAATCCAGTA GGGCGAAATA ATTGACTGTA 550 

20 ATTTACGGGG CAAAACGGCA CAATCTCAAA CGAGATTGTG CCGTTTAAGG 600 

GGAAGATTCT AGAAATATTT CATACTTCCA ACTATATAGT TAAGGAGGAG 650 

ACTGAAAATG AAGAAGTTGT TTTTTTTATT GTTATTGTTA TTCTTAATAT 700 

ACTTAGGTTA TGACTACGTT AATGAAGCAC TG 732 



25 



40 



2) INFORMATION FOR SEQ ID NO: 1075 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 670 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

* 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinamm 

(B) STRAIN: R691 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1075 



TCTCATCATG CGGCAAATGG AATATCATGC AATGAAGCGC AAAATCGCAG 50 

ACGTTTGCGC TCCATCATGG AAAACAGTGG GTTTGAAGCA TATAGCCTCG 100 

45 AATGGTGGCA CTATGTATTA AGAGACGAAC CATACCCCAA TAGCTATTTT 150 

GATTTCCCCG TTAAATAAAC TTTTAACCGT TGCACGGACA AACTATATAA 200 

GCTAACTCTT TCGGCAGGAA ACCCGACGTA TGTAACTGGT TCTTAGGGAA 250 

TTTATATATA GTAGATAGTA TTGAAGATGT AAGGCAGAGC GATATTGCGG 300 

TCATTATCTG CGTGCGCTGC GGCAAGATAG CCTGATAATA AGACTGATCG 350 

50 CATAGAGGGG TGGTATTTCA CACCGCCCAT TGTCAACAGG CAGTTCAGCC 400 

TCGTTAAATT CAGCATGGGT ATCACTTATG AAAATTCATC TACATTGGTG 450 

ATAATAGTAA ATCCAGTAGG GCGAAATAAT TGACTGTAAT TTACGGGGCA 500 

AAACGGCACA ATCTCAAACG AGATTGTGCC GTTTAAGGGG AAGATTCTAG 550 

AAATATTTCA TACTTCCAAC TATATAGTTA AGGAGGAGAC TGAAAATGAA 600 

55 GAAGTTGTTT TTTTTATTGT TATTGTTATT CTTAATATAC TTAGGTTATG 650 

ACTACGTTAA TGAAGCACTG 670 



60 2) INFORMATION FOR SEQ ID NO: 1076 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 948 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM : Escherichia coli 

(B) STRAIN: DG131/3 serotype 0X3 :H8 

(C) ACCESSION NUMBER: Z36901 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1076 

ATGAAAATAA TAATTTTTAG AGTGCTAACT TTTTTCTTTG TTATCTTTTC 50 

TGTTAATGTG GTTGCGAAGG AATTTACCTT AGATTTCTCG ACAGCAAAGA 100 

CGTATGTAGA TTCGCTGAAT GTCATTCGCT CTGCAATAGG TACTCCATTA 150 

20 CAGACTATTT CATCAGGAGG TACGTCTTTA CTGATGATTG ATAGTGGCAC 200 

AGGGGATAAT TTGTTTGCAG TTGATGTCAG AGGGATAGAT CCAGAGGAAG 250 

GGCGGTTTAA TAATCTACGG CTTATTGTTG AACGAAATAA TTTATATGTG 300 

ACAGGATTTG TTAACAGGAC AAATAATGTT TTTTATCGCT TTGCTGATTT 350 

TTCACATGTT ACCTTTCCTG GTACAACTGC GGTTACATTG TCTGGTGACA 400 

25 GTAGCTATAC CACGTTACAG CGTGTTGCGG GGATCAGTCG TACGGGGATG 450 

CAGATAAATC GCCATTCGTT GACTACTTCT TATCTGGATT TAATGTCGCA 500 

TAGCGGAACC TCACTGACGC AGTCTGTGGC AAGAGCGATG TTACGGTTTG 550 

TTACTGTGAC AGCTGAAGCT TTACGTTTTC GGCAAATTCA GAGGGGATTT 600 

CGTACAACAC TTGATGATCT CAGTGGGCGT TCTTATGTAA TGACTGCTGA 650 

30 AGATGTTGAT CTTACGTTGA ACTGGGGAAG GTTGAGTAGT GTCCTGCCTG 700 

ACTATCATGG ACAAGACTCT GTTCGTGTTG GAAGAATTTC TTTTGGAAGT 750 

GTTAATGCAA TTCTGGGTAG CGTGGCATTA ATACTGAATT GTCATCATCA 800 

TGCATCGCGA GTTGCCAGAA TTGTACCTAA TGAGTTTCCT TCTATGTGCC 850 

CGGTAGATGG AAGAGTGCGT GGGATTACGC ACAATAAAAT ATTGTGGGAC 900 

35 TCATCCACTC TGGGGGCAAT TTTGATACGC AGGGCTATTA GCAGTTGA 948 



2) INFORMATION FOR SEQ ID NO: 1077 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1259 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Escherichia coli 

(B) STRAIN: 94C serotype 048:H21 

(C) ACCESSION NUMBER: extracted from Z37725 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1077 

55 

CACCTGTATA TGAAGTGTAT ATTATTTAAA TGGGTACTGT GCCTGTTACT 50 

GGGTTTTTCT TCGGTATCCT ATTCCCGGGA GTTTACGATA GACTTTTCGA 100 

CCCAACAAAG TTATGTCTCT TCGTTAAATA GTATACGGAC AGAGATATCG 150 

ACCCCTCTTG AACAT AT AT C TCAGGGGACC ACATCGGTGT CTGTTATTAA 200 

60 CCACACCCCA CCGGGCAGTT ATTTTGCTGT GGATATACGA GGGCTTGATG 250 

611 
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TCTATCAGGC GCGTTTTGAC CATCTTCGGC 
TTATATGTGG CCGGGTTCGT TAATACGGCA 
TTCAGATTTT ACACATATAT CAGTGCCCGA 
CAACGGACAG CAGTTATACC ACTCTGCAAC 
TCCGGAATGC AAATCAGTCG TCACTCACTG 
AATGGAGTTC AGTGGTAATA CAATGACCAG 
TGCGTTTTGT CACTGTCACA GCAGAAGCCT 
AGAGAATTTC GTCAGGCACT GTCTGAAACT 
GCCGGGAGAC GTGGACCTCA CTCTGAACTG 
TTCCGGAGTA TCGGGGAGAG GATGGTGTCA 
AATAATATAT CAGCGATACT GGGTACTGTG 
TCATCAGGGG GCGCGTTCTG TTCGCGCCGT 
AATGTCAGAT AACTGGCGAC AGGCCCGTTA 
TGGGAAAGTA ATACAGCTGC AGCGTTTCTG 
ATATACAACG GGTAAATAAA GGAGTTAAGT 
GGTTTTATTT GCATTAGTTT CTGTTAATGC 
AAGGTAAAAT TGAGTTTTCC AAGTATAATG 
AAGGTTGACG GGAAAGAATA CTGGACCAGT 
ACTGCAAAGT GCTCAGCTGA CAGGAATGAC 
CCTGTGAATC AGGCTCCGGA TTTGCTGAAG 
GGCATAACC 



PCT/CA00/01150 

TGATTATTGA GCAAAATAAT 300 

ACAAATACTT TCTACCGTTT 350 

TGTGACAACG GTTTCCATGA 400 

GTGTCGCAGC GCTGGAACGT 450 

GTTTCATCAT ATCTGGCGTT 500 

AGATGCATCC AGAGCAGTTC 550 

TACGCTTCAG GCAGATACAG 600 

GCTCCTGTGT ATACCATGAC 650 

GGGGCGAATC AGCAATGTGC 700 

GAGTGGGGAG AATATCCTTT 750 

GCCGTTATAC TGAATTGCCA 800 

GAATGAAGAG AGTCAACCAG 850 

TAAAAATAAA CAATACATTA 900 

AACAGAAAGT CACAGTTTTT 950 

ATGAAGAAGA TGTTTATGGC 1000 

AATGGCGGCG GATTGTGCTA 1050 

AGGATGACAC ATTTACAGTG 1100 

CGCTGGAATC TGCAACCGTT 1150 

TGTCACAATC AAATCCAGTA 1200 

TGCAGTTTAA TAATGACTGA 1250 

1259 



2) INFORMATION FOR SEQ ID NO: 1078 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear r 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1078 
AGTTCTGCGT TTTGTCACTG TC 



2) INFORMATION FOR SEQ ID NO: 1079 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1079 
CGGAAGCACA TTGCTGATT 



2) INFORMATION FOR SEQ ID NO: 1080 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1080 
TATAGCTACT GTCACCAGAC AATGT 



2) INFORMATION FOR SEQ ID NO: 1081 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1081 
ATGTCAGAGG GATAGATCCA 



2) INFORMATION FOR SEQ ID NO: 1082 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1082 
TTGARCRAAA TAATTTATAT GTG 



2) INFORMATION FOR SEQ ID NO: 1083 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1083 
TGATGATGRC AATTCAGTAT 



613 



SUBSTITUTE SHEET (RULE 26) 



LO 



15 



WO 01/23604 PCT/CA00/01150 
2) INFORMATION FOR SEQ ID NO: 1084 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1084 
CCACGCCGCT TTGCTGATTT TTCACATGTT ACCGCGTGG 39 



2) INFORMATION FOR SEQ ID NO: 1085 



(i) SEQUENCE CHARACTERISTICS: 
2 0 (A) LENGTH: 34 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1085 
CCACGCCACT GTCTGAAACT GCTCCTGTGC GTGG 34 

30 

2) INFORMATION FOR SEQ ID NO: 1086 . 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

4 0. 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1086 
45 CTACTCCCGC CTTTTGGGTT 20 



2) INFORMATION FOR SEQ ID NO: 1087 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1087 

60 
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60 
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CTCACAGCCC GAAACAGCCT 20 



5 2) INFORMATION FOR SEQ ID NO: 1088 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1088 

TGCCGTTTCC TGTATCCGTC 20 



2) INFORMATION FOR SEQ ID NO: 1089 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2 0 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi)' SEQUENCE DESCRIPTION: SEQ ID NO: 1089 
ATCCACACGG GCTAGACCTC 20 



2) INFORMATION FOR SEQ ID NO: 1090 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1090 
AATAGCGCGG ACGAATTGGA C 21 

50 



2) INFORMATION FOR SEQ ID NO: 1091 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1091 
AACGCGGCAC TGTTTCCCAA 



2) INFORMATION FOR SEQ ID NO: 1092 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1092 
TCGGCAAGAC AATATGACAG C 



2) INFORMATION FOR SEQ ID NO: 1093 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 



(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: CSsa-165 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1093 



2) INFORMATION FOR SEQ ID NO: 1094 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 



TAACGGGCGT CTCGATAGAA 
AATCAATATA CGATTTTGCT 
ACAAATAAAA ATTAATAGGC 
CGATTCTGGT TTTTTAAATT 
TTTTAAAGTA TAGGTATAAA 
AATCATGAAA CAATTAACTA 
TTACAACAAC ACTGATTGGC 
AAACATCCTA TTGATTTTGT 



AAACACGTGA AAATCCCAAT GATTATAAAC 
GAAGCTGTAA CAAAAGGTAT TAAGGAACAA 
AACTTAACCA GAATCGTTAA AACTATATGA 
CAAAAAGTTT TCTAAAAAAT TTACCTGCTT 
ATACAATTGA TTAAAATAGT AAAGGAAATG 
AGCCTTTATA CTTTTACCTA TTACTTTTTA 
GCGTTACTAT TATATTTGCC AATCACAGGT 
GGACGCCCGT T 



50 
100 
150 
200 
250 
300 
350 
381 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1098 
TCTTCAAAAT CGAAAAAGCC GTC 



2) INFORMATION FOR SEQ ID NO: 1099 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1099 
TCAAAAGGGA T C AC WAAAGT MAC 



2) INFORMATION FOR SEQ ID NO: 1100 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1100 
GTAAAKCCCG GCATRGTRTT GATTTC 



2) INFORMATION FOR SEQ ID NO: 1101 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1101 
GACGGYTTTT TYGATTTTGA AGA 



2) INFORMATION FOR SEQ ID NO: 1102 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1102 
AAAAARTCGA TKCGAGCMAG ACC 



2) INFORMATION FOR SEQ ID NO: 1103 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1103 
ATCCCGCTAT GAAAACGATC 



2) INFORMATION FOR SEQ ID NO: 1104 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1104 
GGATCAACAC AGTAGAACCG 



2 ) INFORMATION FOR SEQ ID NO: 1105 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1105 
CTCCTACGAT TCTCTTGAYA AATCA 

619 
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2) INFORMATION FOR SEQ ID NO: 1106 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1106 

15 

CAACCGATCT CAACACCGGC AAT 23 



20 2) INFORMATION FOR SEQ ID NO: 1107 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1107 

CTCATTTGAC TTCCTCCTTT GCT 23 



35 



45 



50 



2) INFORMATION FOR SEQ ID NO: 1108 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1108 
GTAAGAATCG GAAAAGCGGA AGG 23 



2 ) INFORMATION FOR SEQ ID NO: 1109 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: DNA 

620 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1109 
ACATCGTGAT CGCTAAAAGG AGC 
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2) INFORMATION FOR SEQ ID NO: 1110 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1110 
ACGAGAAAGA CAACAGGAAG ACC 



2) INFORMATION FOR SEQ ID NO: 1111 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1111 
CTTTTTCCGG CTCGWYTTCC TGATG 



2) INFORMATION FOR SEQ ID NO: 1112 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1112 
GGCTGYGATA TTCAAAGCTC 



2) INFORMATION FOR SEQ ID NO: 1113 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1113 
ACCGACCTCA CAGCCCGAAA 20 



2) INFORMATION FOR SEQ ID NO: 1114 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1114 
TCWGAGCCTT TTTCCGGCTC G 21 

25 

2) INFORMATION FOR SEQ ID NO: 1115 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

' (C) STRANDEDNESS: Single 
(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1115 
40 TTTCGGGCTG TGAGGTCGGB TGHGCG 26 



2 ) INFORMATION FOR SEQ ID NO: 1116 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1116 

55 

TTTCGGGCTG TGAGGTCGGB TGHGCGG 27 



60 2) INFORMATION FOR SEQ ID NO: 1117 

622 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01150 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 801 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

L0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: 94 

(C) ACCESSION NUMBER: U94526 

L5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1117 

AAATTCGATC CGCACTACAT CGGAATTACA AAAAACGGTG TATGGAAGCT 50 

ATGCAAGAAG CCATGTACGG AATGGGAAGC CGACAGTCTC CCCGCCATAC 100 

TCTCCCCGGA TAGGAAAACG CATGGGCTGC TTGTCATGAA AGAAAGCGAA 150 

20 TACGAAACAC GGCGTATTGA TGTGGCTTTC CCGGTTTTGC ATGGCAAATG 200 

CGGGGAGGAT GGTGCGATAC AGGGGCTGTT TGTATTGTCT GGTATCCCCT 250 

ATGTGGGCTG TGATATTCAA AGCTCCGCAG CTTGCATGGA CAAATCACTG 3 00 

GCCTACATTC TTACAAAAAA TGCGGGCATC GCCGTTCCCG AATTTCAAAT 350 

GATTGATAAA GGTGACAAGC CGGAGGCGGG TGCGCTTACC TACCCTGTCT 4 00 

25 TTGTGAAGCC GGCACGGTCA GGTTCGTCCT TTGGCGTAAC CAAAGTAAAC 450 

GGTACGGAAG AACTTAACGC TGCGATAGAA GCGGCAGGAC AATATGATGG 500 

AAAAATCTTA ATTGAGCAAG CGATTTCGGG CTGTGAGGTC GGGTGTGCGG 550 

TCATGGGGAA CGAGGATGAT TTGATTGTCG GCGAAGTGGA TCAAATCCGG 600 

CTGAGCCACG GTATCTTCCG CATCCATCAG GAAAACGAGC CGGAAAAAGG 650 

10 CTCAGAAAAT GCGATGATTA CAGTTCCCGC AGACATTCCG GTCGAGGAAC 700 

GAAATCGGGT GCAGGAAACG GCAAAGAAAG TATATCGGGT GCTTGGATGC 750 

AGAGGGCTTG CCCGTGTTGA TCTTTTTTTG CAGGAGGATG GCGGCATCGT 800 

T 1 801 



35 



2 ) INFORMATION FOR SEQ ID NO: 1118 



(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1118 
TTTTCWGAGC CTTTTTCCGG CTCG 24 



50 



2) INFORMATION FOR SEQ ID NO: 1119 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1119 
TTTCGGGCTG TGAGGTCGGB TGHGC 



2) INFORMATION FOR SEQ ID NO: 1120 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1120 
TTTCGGGCTG TGAGGTCGGB TGHG 



2) INFORMATION FOR SEQ ID NO: 1121 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1121 
TGTTTGWATT GTCYGGYATC CC 



2) INFORMATION FOR SEQ ID NO: 1122 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single ■ 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1122 
TGGTGCATTG CTACGTGG 



2) INFORMATION FOR SEQ ID NO: 1123 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
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2) INFORMATION FOR SEQ ID NO: 1127 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1127 
KCAAAYGCCA TTTCAAGTAA 20 



2) INFORMATION FOR SEQ ID NO: 1128 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 112 8 
GACGACYTTA TKGATATACA 20 

30 

2) INFORMATION FOR SEQ ID NO: 1129 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1129 
45 ATGATGACHG AMATGATGAA AAC 23 



2) INFORMATION FOR SEQ ID NO: 1130 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1130 

60 
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22 



5 2 ) INFORMATION FOR SEQ ID NO: 1131 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1131 

CATCTGGAGC TACRTARCCA GT 22 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1132 



(i) SEQUENCE CHARACTERISTICS: 
■ (A) LENGTH: 20 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1132 
AGTGAAAARA TGGCTGCTGC ' ,20 



2 ) INFORMATION FOR SEQ ID NO: 1133 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1133 
CATCAAGAAC ACTGGCTAYG TAG 23 

50 



2) INFORMATION FOR SEQ ID NO: 1134 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

60 

627 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01150 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1134 
CTAGATAGAG CTAAAACCTT CCT 



2 ) INFORMATION FOR SEQ ID NO: 1135 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1135 
CATTATGCAA ACGCCATTTC AAG 



2) INFORMATION FOR SEQ ID NO: 1136 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1136 
ACTTGTCCAC GTTSGATRTC T 



2) INFORMATION FOR SEQ ID NO: 1137 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1137 
AATTAATGGC TGCWGTTGAY GAA 



2) INFORMATION FOR SEQ ID NO: 1138 

(i) SEQUENCE CHARACTERISTICS: 
' (A) LENGTH: 1032 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 
(C) ACCESSION NUMBER: M75132 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1138 



ATGAAAAAAA TTGCCGTTTT ATTTGGAGGG AATTCTCCAG AATACTCAGT 50 

GTCACTAACC TCAGCAGCAA GTGTGATCCA AGCTATTGAC CCGCTGAAAT 100 

15 ATGAAGTAAT GACCATTGGC ATCGCACCAA CAATGGATTG GTATTGGTAT 150 

CAAGGAAACC TCGCGAATGT TCGCAATGAT ACTTGGCTAG AAGATCACAA 200 

AAACTGTCAC CAGCTGACTT TTTCTAGCCA AGGATTTATA TTAGGAGAAA 250 

AACGAATCGT CCCTGATGTC CTCTTTCCAG TCTTGCATGG GAAGTATGGC 300 

GAGGATGGCT GTATCCAAGG ACTGCTTGAA CTAATGAACC TGCCTTATGT 350 

2 0 TGGTTGCCAT GTCGCTGCCT CCGCATTATG TATGAACAAA TGGCTCTTGC 400 

ATCAACTTGC TGATACCATG GGAATCGCTA GTGCTCCCAC TTTGCTTTTA 450 

TCCCGCTATG AAAACGATCC TGCCACAATC GATCGTTTTA TTCAAGACCA 500 

TGGATTCCCG ATCTTTATCA AGCCGAATGA AGCCGGTTCT TCAAAAGGGA 550 

TCACAAAAGT AACTGACAAA ACAGCGCTCC AATCTGCATT AACGACTGCT 60 0 

25 TTTGCTTACG GTTCTACTGT GTTGATCCAA AAGGCGATAG CGGGTATTGA 650 

AATTGGCTGC GGCATCTTAG GAAATGAGCA ATTGACGATT GGTGCTTGTG 700 

ATGCGATTTC TCTTGTCGAC GGTTTTTTTG ATTTTGAAGA GAAATACCAA 750 

TTAATCAGCG CCACGATCAC TGTCCCAGCA CCATTGCCTC TCGCGCTTGA 800 

ATCACAGATC AAGGAGCAGG CACAGCTGCT TTATCGAAAC TTGGGATTGA 850 

3 0 CGGGTCTGGC TCGAATCGAT TTTTTCGTCA CCAATCAAGG AGCGATTTAT 900 

TTAAACGAAA TCAACACCAT GCCGGGATTT ACTGGGCACT CCCGCTACCC 950 

AGCTATGATG GCGGAAGTCG GGTTATCCTA CGAAATATTA GTAGAGCAAT 1000 

TGATTGCACT GGCAGAGGAG GACAAACGAT GA 1 1032 

35 

2) INFORMATION FOR SEQ ID NO: 1139 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1768 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: BM4147 

50 (C) ACCESSION NUMBER: X56895 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1139 

GATATCGTTA CGCTTCATGT GCCGCTCAAT ACGGATACGC ACTATATTAT 50 

55 CAGCCACGAA CAAATACAGA GAATGAAGCA AGGAGCATTT CTTATCAATA 100 

CTGGGCGCGG TCCACTTGTA GATACCTATG AGTTGGTTAA AGCATTAGAA 150 

AACGGGAAAC TGGGCGGTGC CGCATTGGAT GTATTGGAAG GAGAGGAAGA 200 

GTTTTTCTAC TCTGATTGCA CCCAAAAACC AATTGATAAT CAATTTTTAC 250 

TTAAACTTCA AAGAATGCCT AACGTGATAA TCACACCGCA TACGGCCTAT 300 

60 TATACCGAGC AAGCGTTGCG TGATACCGTT GAAAAAACCA TTAAAAACTG 350 
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TTTGGATTTT GAAAGGAGAC AGGAGCATGA ATAGAATAAA AGTTGCAATA 400 

CTGTTTGGGG GTTGCTCAGA GGAGCATGAC GTATCGGTAA AATCTGCAAT 450 

AGAGATAGCC GCTAACATTA ATAAAGAAAA ATACGAGCCG TTATACATTG 500 

GAATTACGAA ATCTGGTGTA TGGAAAATGT GCGAAAAACC TTGCGCGGAA 550 

5 TGGGAAAACG ACAATTGCTA TTCAGCTGTA CTCTCGCCGG ATAAAAAAAT 600 

GCACGGATTA CTTGTTAAAA AGAACCATGA ATATGAAATC AACCATGTTG 650 

ATGTAGCATT TTCAGCTTTG CATGGCAAGT CAGGTGAAGA TGGATCCATA 700 

CAAGGTCTGT TTGAATTGTC CGGTATCCCT TTTGTAGGCT GCGATATTCA 750 

AAGCTCAGCA ATTTGTATGG ACAAATCGTT GACATACATC GTTGCGAAAA 800 

LO ATGCTGGGAT AGCTACTCCC GCCTTTTGGG TTATTAATAA AGATGATAGG 850 

CCGGTGGCAG CTACGTTTAC CTATCCTGTT TTTGTTAAGC CGGCGCGTTC 900 

AGGCTCATCC TTCGGTGTGA AAAAAGTCAA TAGCGCGGAC GAATTGGACT 950 

ACGCAATTGA ATCGGCAAGA CAATATGACA GCAAAATCTT AATTGAGCAG 1000 

GCTGTTTCGG GCTGTGAGGT CGGTTGTGCG GTATTGGGAA ACAGTGCCGC 1050 

15 GTTAGTTGTT GGCGAGGTGG ACCAAATCAG GCTGCAGTAC GGAATCTTTC 1100 

GTATTCATCA GGAAGTCGAG CCGGAAAAAG GCTCTGAAAA CGCAGTTATA 1150 

ACCGTTCCCG CAGACCTTTC AGCAGAGGAG CGAGGACGGA TACAGGAAAC 1200 

GGCAAAAAAA ATATATAAAG CGCTCGGCTG TAGAGGTCTA GCCCGTGTGG 1250 

ATATGTTTTT ACAAGATAAC GGCCGCATTG TACTGAACGA AGTCAATACT 1300 

20 CTGCCCGGTT TCACGTCATA CAGTCGTTAT CCCCGTATGA TGGCCGCTGC 1350 

AGGTATTGCA CTTCCCGAAC TGATTGACCG CTTGATCGTA TTAGCGTTAA 1400 

AGGGGTGATA AGCATGGAAA TAGGATTTAC TTTTTTAGAT GAAATAGTAC 1450 

ACGGTGTTCG TTGGGACGCT AAATATGCCA CTTGGGATAA TTTCACCGGA 1500 

AAACCGGTTG ACGGTTATGA AGTAAATCGC ATTGTAGGGA CATACGAGTT 1550 

25 GGCTGAATCG CTTTTGAAGG CAAAAGAACT GGCTGCTACC CAAGGGTACG 1600 

GATTGCTTCT ATGGGACGGT TACCGTCCTA AGCGTGCTGT AAACTGTTTT 1650 

ATGCAATGGG CTGCACAGCC GGAAAATAAC CTGACAAAGG AAAGTTATTA 1700 

TCCCAATATT GACCGAACTG AGATGATTTC AAAAGGATAC GTGGCTTCAA 1750 

AATCAAGCCA TAGCCGCG 1768 

30 



2) INFORMATION FOR SEQ ID NO: 1140 



35 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1086 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 
45 (C) ACCESSION NUMBER: L2 9638 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1140 



GTAAGAATCG GAAAAGCGGA 
50 ATTTTTGGAG GCAATTCACC 
TAGCGCAATC GAAGCACTCC 
TGATCGGGAT CGCCCCAGAT 
CTGGAAAACA TCCGACAAGA 
GAAAATACAG CCGCTATTCG 
55 AGCAAACGTT GGTACCTGAT 
GGGGAAGATG GCAGTATCCA 
TGTAGGCTGC GGGGTGGCAG 
TGCATCAAGC TGCAGCAGCC 
TTGACAAATC AAGCCAACCA 
60 CCATGGCTTC CCAGTTTTCT 



AGGAAGAAAA ACATGAAAAA 
GGAATACACC GTTTCTTTAG 
AATCATCTCC CTATGACTAC 
GCTATGGATT GGTACTTGTA 
CACGTGGTTG TTGGATACGA 
AAGGAAACGG CTTTTGGCTA 
GTTTTATTTC CCATTATGCA 
AGGATTGTTT GAATTGATGA 
GTTCTGCCTT ATGTATGAAC 
ATTGGCGTAC AAAGTGCTCC 
GCAAGAACAA ATCGAAGCTT 
TTAAG CCTAA TGAAGCGGGC 

630 



AATCGCCATT 


50 


CTTCAGCAAC 


100 


GACCTCTCTT 


150 


TACAGGAGAA 


200 


AACATAAACA 


250 


AGTGAAGAGC 


300 


TGGCAAATAC 


350 


AGCTGCCTTA 


400 


AAATGGCTGC 


450 


TACGATTCTC 


500 


TTATCCAGAC 


550 


TCCTCAAAAG 


600 
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GGATCACTAA AGTCACCTGC GTTGAAGAAA 
GCCTTTACTT ATTGTTCCGC AGTGCTCCTA 
TGAGATCGGT TGCGGTATTT TGGGCAACGA 
GTGACGCCAT TTCATTAGTA GACGGCTTTT 
5 CAGCTGATCA GCGCCAAAAT CACCGTCCCT 
TGAAACCAAG GTCAAAGAAC AAGCTCAGCT 
TTAAAGGTCT TGCTCGCATC GACTTTTTTG 
TACTTGAATG AAATCAATAC TATGCCGGGC 
TCCTGCCATG ATGGCAGCGG TCGGCTTATC 
LO AACTGCTTGT CTTAGCAAAG GAGGAAGTCA 
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TCGCTTCTGC CTTAAAAGAA 650 

CAAAAAAATA TTGCCGGTGT 700 

CTCTTTGACT GTCGGTGCTT 750 

TCGATTTTGA AGAAAAGTAC 800 

GCGCCATTGC CTGAAACGAT 850 

GCTCTATCGT AGTCTTGGTC 900 

TCACGGAGCG AGGAGAACTA 950 

TTTACGAGTC ACTCCCGCTA 1000 

CTATCAAGAA CTACTACAAA 1050 

AATGAG 1086 



2) INFORMATION FOR SEQ ID NO: 1141 

L5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 946 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
25 (B) STRAIN: BM4147 

(C) ACCESSION NUMBER: extracted from M972 97 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1141 

30 ATGAATAACA TCGGCATTAC TGTTTATGGA TGTGAGCAGG ATGAGGCAGA 50 

TGCATTCCAT GCTCTTTCGC CTCGCTTTGG CGTTATGGCA ACGATAATTA 100 

ACGCCAACGT GTCGGAATCC AACGCCAAAT CCGCGCCTTT CAATCAATGT 150 

ATCAGTGTGG GACATAAATC AGAGATTTCC GCCTCTATTC TTCTTGCGCT 200 

GAAGAGAGCC GGTGTGAAAT ATATTTCTAC CCGAAGCATC GGCTGCAATC 250 

35 ATATAGATAC AACTGCTGCT AAGAGAATGG GCATCACTGT CGACAATGTG 300 

GCGTACTCGC CGGATAGCGT TGCCGATTAT ACTATGATGC TAATTCTTAT 350 

GGCAGTACGC AACGTAAAAT CGATTGTGCG CTCTGTGGAA AAACATGATT 400 

TCAGGTTGGA CAGCGACCGT GGCAAGGTAC TCAGCGACAT GACAGTTGGT 450 

GTGGTGGGAA CGGGCCAGAT AGGCAAAGCG GTTATTGAGC GGCTGCGAGG 500 

40 ATTTGGATGT AAAGTGTTGG CTTATAGTCG CAGCCGAAGT ATAGAGGTAA 550 

ACTATGTACC GTTTGATGAG TTGCTGCAAA ATAGCGATAT CGTTACGCTT 600 

CATGTGCCGC TCAATACGGA TACGCACTAT ATTATCAGCC ACGAACAAAT 650 

ACAGAGAATG AAGCAAGGAG CATTTCTTAT CAATACTGGG CGCGGTCCAC 700 

TTGTAGATAC CTATGAGTTG GTTAAAGCAT TAGAAAACGG GAAACTGGGC 750 

45 GGTGCCGCAT TGGATGTATT GGAAGGAGAG GAAGAGTTTT TCTACTCTGA 800 

TTGCACCCAA AAACCAATTG ATAATCAATT TTTACTTAAA CTTCAAAGAA 850 

TGCCTAACGT GATAATCACA CCGCATACGG CCTATTATAC CGAGCAAGCG 900 

TTGCGTGATA CCGTTGAAAA AACCATTAAA AACTGTTTGG ATTTTGAAAG 950 

GAGACAGGAG CATGAATAGA ATAAAAGTTG CAATACTGTT TGGGGGTTGC 1000 

50 TCAGAGGAGC ATGACGTATC GGTAAAATCT GCAATAGAGA TAGCCGCTAA 1050 

CATTAATAAA GAAAAATACG AGCCGTTATA CATTGGAATT ACGAAATCTG 1100 

GTGTATGGAA AATGTGCGAA AAACCTTGCG CGGAATGGGA AAACGACAAT 1150 

TGCTATTCAG CTGTACTCTC GCCGGATAAA AAAATGCACG GATTACTTGT 1200 

TAAAAAGAAC CATGAATATG AAATCAACCA TGTTGATGTA GCATTTTCAG 1250 

55 CTTTGCATGG CAAGTCAGGT GAAGATGGAT CCATACAAGG TCTGTTTGAA 13 00 

TTGTCCGGTA TCCCTTTTGT AGGCTGCGAT ATTCAAAGCT CAGCAATTTG 1350 

TATGGACAAA TCGTTGACAT ACATCGTTGC GAAAAATGCT GGGATAGCTA 1400 

CTCCCGCCTT TTGGGTTATT AATAAAGATG ATAGGCCGGT GGCAGCTACG 1450 

TTTACCTATC CTGTTTTTGT TAAGCCGGCG CGTTCAGGCT CATCCTTCGG 1500 

60 TGTGAAAAAA GTCAATAGCG CGGACGAATT GGACTACGCA ATTGAATCGG 1550 
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CAAGACAATA TGACAGCAAA ATCTTAATTG AGCAGGCTGT TTCGGGCTGT 1600 

GAGGTCGGTT GTGCGGTATT GGGAAACAGT GCCGCGTTAG TTGTTGGCGA 1650 

GGTGGACCAA ATCAGGCTGC AGTACGGAAT CTTTCGTATT CATCAGGAAG 1700 

TCGAGCCGGA AAAAGGCTCT GAAAACGCAG TTATAACCGT TCCCGCAGAC 1750 

5 CTTTCAGCAG AGGAGCGAGG ACGGATACAG GAAACGGCAA AAAAAATATA 1800 

TAAAGCGCTC GGCTGTAGAG GTCTAGCCCG TGTGGATATG TTTTTACAAG 1850 

ATAACGGCCG CATTGTACTG AACGAAGTCA ATACTCTGCC CGGTTTCACG 1900 

TCATACAGTC GTTATCCCCG TATGATGGCC GCTGCAGGTA TTGCACTTCC 1950 

CGAACTGATT GACCGCTTGA TCGTATTAGC GTTAAAGGGG TGATAAGCAT 2 000 

10 GGAAATAGGA TTTACTTTTT TAGATGAAAT AGTACACGGT GTTCGTTGGG 2050 

ACGCTAAATA TGCCACTTGG GATAATTTCA CCGGAAAACC GGTTGACGGT 2100 

TATGAAGTAA ATCGCATTGT AGGGACATAC GAGTTGGCTG AATCGCTTTT 2150 

GAAGGCAAAA GAACTGGCTG CTACCCAAGG GTACGGATTG CTTCTATGGG 2200 

ACGGTTACCG TCCTAAGCGT GCTGTAAACT GTTTTATGCA ATGGGCTGCA 2250 

15 CAGCCGGAAA ATAACCTGAC AAAGGAAAGT TATTATCCCA ATATTGACCG 2300 

AACTGAGATG ATTTCAAAAG GATACGTGGC TTCAAAATCA AGCCATAGCC 2350 

GCGGCAGTGC CATTGATCTT ACGCTTTATC GATTAGACAC GGGTGAGCTT 24 00 

GTACCAATGG GGAGCCGATT TGATTTTATG GATGAACGCT CTCATCATGC 2450 

GGCAAATGGA ATATCATGCA ATGAAGCGCA AAATCGCAGA CGTTTGCGCT 2500 

20 CCATCATGGA AAACAGTGGG TTTGAAGCAT ATAGC CTCGA ATGGTGGCAC 2 550 

TATGTATTAA GAGACGAACC ATACCCCAAT AGCTATTTTG ATTTCCCCGT 2600 

TAAATAAACT TTTAACCGTT GCACGGACAA ACTATATAAG CTAACTCTTT 2650 

CGGCAGGAAA CCCGACGTAT GTAACTGGTT CTTAGGGAAT TTATATATAG 2700 

TAGATAGTAT TGAAGATGTA AGGCAGAGCG ATATTGCGGT CATTATCTGC 2750 

25 GTGCGCTGCG GCAAGATAGC CTGATAATAA GACTGATCGC ATAGAGGGGT 2 800 

GGTATTTCAC ACCGCCCATT GTCAACAGGC AGTTCAGCCT CGTTAAATTC 2850 

AGCATGGGTA TCACTTATGA AAATTCATCT ACATTGGTGA TAATAGTAAA 2900 

TCCAGTAGGG CGAAATAATT GACTGTAATT TACGGGGCAA AACGGCACAA 2 950 

TCTCAAACGA GATTGTGCCG TTTAAGGGGA AGATTCTAGA AATATTTCAT 3 000 

30 ACTTCCAACT ATATAGTTAA GGAGGAGACT GAAAATGAAG AAGTTGTTTT 3050 

TTTTATTGTT ATTGTTATTC TTAATATACT TAGGTTATGA CTACGTTAAT 3100 

GAAGCACTGT TTTCTCAGGA AAAAGTCGAA TTTCAAAATT ATGATCAAAA . 3150 

TCCCAAAGAA CATTTAGAAA ATAGTGGGAC TTCTGAAAAT ACCCAAGAGA 3200 

AAACAATTAC AGAAGAACAG GTTTATCAAG GAAATCTGCT ATTAATCAAT 3250 

35 AGTAAATATC CTGTTCGCCA AGAAAGTGTG AAGTCAGATA TCGTGAATTT 33 00 

ATCTAAACAT GACGAATTAA TAAATGGATA CGGGTTGCTT GATAGTAATA 3350 

TTTATATGTC AAAAGAAATA GCACAAAAAT TTTCAGAGAT GGTCAATGAT 3400 

GCTGTAAAGG GTGGCGTTAG TCATTTTATT ATTAATAGTG GCTATCGAGA 3450 

CTTTGATGAG CAAAGTGTGC TTTACCAAGA AATGGGGGCT GAGTATGCCT 3500 

40 TACCAGCAGG TTATAGTGAG CATAATTCAG GTTTATCACT AGATGTAGGA 3550 

TCAAGCTTGA CGAAAATGGA ACGAGCCCCT GAAGGAAAGT GGATAGAAGA 3600 

AAATGCTTGG AAATACGGGT TCATTTTACG TTATCCAGAG GACAAAACAG 3650 

AGTTAACAGG AATTCAATAT GAACCATGGC ATATTCGCTA TGTTGGTTTA 3700 

CCACATAGTG CGATTATGAA AGAAAAGAAT TTCGTTCTCG AGGAATATAT 3750 

45 GGATTACCTA AAAGAAGAAA AAACCATTTC TGTTAGTGTA AATGGGGAAA 3800 

AATATGAGAT CTTTTATTAT CCTGTTACTA AAAATACCAC CATTCATGTG 3850 

CCGACTAATC TTCGTTATGA GATATCAGGA AACAATATAG ACGGTGTAAT 3 900 

TGTGACAGTG TTTCCCGGAT CAACACATAC TAATTCAAGG AGGTAA 3946 



50 



2) INFORMATION FOR SEQ ID NO: 1142 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

60 (ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1142 
GATCCTCTAA ATGATTCTCA GGTGG 



PCT/CA00/01150 



2) INFORMATION FOR SEQ ID NO: 1143 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1143 
CAATTAGCTT AGCAATAGGT GTTGG 



2) INFORMATION FOR SEQ ID NO: 1144 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: t 1144 
TGTYTTCCAA GGTTCAGCTC 



2) INFORMATION FOR SEQ ID NO: 1145 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1145 
AACATATTKG GTTGATAGGT 



2) INFORMATION FOR SEQ ID NO: 1146 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1146 
GGGATTACCT ATGCCAATAT GAT 23 



2) INFORMATION FOR SEQ ID NO: 1147 



(i) SEQUENCE CHARACTERISTICS : 
15. (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1147 
AGCTGTGTTA GCVCGAACAT CTTG 24 

25 

2) INFORMATION FOR SEQ ID NO: 1148 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1148 
40 GACTTTGTTT GGCGTGATAT 2 0 



2) INFORMATION FOR SEQ ID NO: 1149 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1149 

55 

TCCYACWATT TCTTTTTGWG 20 



60 2) INFORMATION FOR SEQ ID NO: 1150 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1150 
TGATAATCAC ACCGCATACG 



2) INFORMATION FOR SEQ ID NO: 1151 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1151 
TGCTGTCATA TTGTCTTGCC 



2) INFORMATION FOR SEQ ID NO: 1152 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1152 
ATAAAGATGA TAGGCCGGTG 



2) INFORMATION FOR SEQ ID NO: 1153 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1153 
CTCGTATGTC CCTACAATGC 
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2) INFORMATION FOR SEQ ID NO: 1154 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1154 
GTTTGAAGCA TATAGCCTCG 



20 



20 2) INFORMATION FOR SEQ ID NO: 1155 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1155 



CAGTGCTTCA TTAACGTAGT C 



21 



35 



2) INFORMATION FOR SEQ ID NO: 1156 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



\ 



45 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1156 



GTTGAAATGC ATCACGAACA ATT 



23 



50 



2) INFORMATION FOR SEQ ID NO: 1157 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



60 



(ii) MOLECULE TYPE: DNA 



636 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1157 
AAGAACGTTT CAGTTAAGGA AAT 
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2) INFORMATION FOR SEQ ID NO: 1158 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1158 
AAGAGGTAAT GTCTGTGGT 



2 ) INFORMATION FOR SEQ ID NO: 1159 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1159 
TGAAGGTTTG CCAGGTGA 



2) INFORMATION FOR SEQ ID NO: 1160 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1160 
CGTTTCTGTT AAAGAAATTA GAAG 



2) INFORMATION FOR SEQ ID NO: 1161 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

637 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1161 
TCCAGGTGAT AACGTTGG 



2) INFORMATION FOR SEQ ID NO: 1162 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1162 
CAAGTCCGTG GAAATGCA 



2 ) INFORMATION FOR SEQ ID NO: 1163 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1163 
GTTGGTTTCA ACGTTAAGAA C 



2) INFORMATION FOR SEQ ID NO: 1164 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1164 
GGTTTCAACG TCAAGAAC 



2) INFORMATION FOR SEQ ID NO: 1165 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1165 
GGTTTCAACG TGAAGAAC 



2) INFORMATION FOR SEQ ID NO: 1166 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1166 
ACGTTAAGAA TGTTTCTGTC AA 



2) INFORMATION FOR. SEQ ID NO: 1167 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1167 
GTTGGTTTCA ACGT 



2) INFORMATION FOR SEQ ID NO: 1168 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1168 
GAACAATTGG TTGAAGGTGT 
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2) INFORMATION FOR SEQ ID NO: 1169 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2160 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: SP-665 

(C) ACCESSION NUMBER: AF139883 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1169 



20 



ATGAACAAAC CAACGATTCT GCGCCTAATC AAGTATCTGA GCATTAGCTT 50 

CTTAAGCTTG GTTATCGCAG CCATTGTCTT AGGCGGAGGA GTTTTTTTCT 100 

ACTACGTTAG CAAGGCTCCT AGCCTATCCG AGAGTAAACT AGTTGCAACA 150 

ACTTCTAGTA AAATCTACGA CAATAAAAAT CAACTCATTG CTGACTTGGG 200 

25 TTCTGAACGC CGCGTCAATG CCCAAGCTAA TGATATTCCC ACAGATTTGG 250 

TTAAGGCAAT CGTTTCTATC GAAGACCATC GCTTCTTCGA CCACAGGGGG 300 

ATTGATTCCA TCCGTATCCT GGGAGCTTTC TTGCGCAATC TGCAAAGTAA 350 

TTCCCTCCAA GGTGGAT CAA CTCTCACCCA ACAGTTGATT AAGTTGACTT 400 

ACTTTTCAAC CTCGACTTCC GACCAGACTA TTTCTCGTAA GGC TCAGGAA 450 

30 GCTTGGTTAG CGATTCAGTT AGAACAAAAA GCAACCAAAC AGGAAATCTT 500 

GACCTACTAT ATAAATAAGG TCTACATGTC TAATGGCAAC TATGGAATGC 550 

AGACAGCAGC TCAAAACTAC TATGGTAAAG ACCTCAATAA TTTAAGTTTA 600 

CCTCAGTTAG CCTTGCTGGC TGGAATGCCT CAGGCACCAA ACCAATATGA 650 

CCCCTATTCA CATCCAGAAG C AG C CCAAGA CCGCCGAAAC TTGGTCTTAT 700 

35 CTGAAATGAA AAATCAAGGT TACATCTCTG CTGAACAGTA TGAGAAAGCA 750 

GTCAATACAC CAATTACTGA TGGACTACAA AGTCTCAAAT CAGCAAGTAA 800 

TTACCCTGCT TACATGGATA ATTACCTCAA GGAGGTCATC AATCAAGTAG 850 

AACAAGAAAC TGGCTATAAC CTTCTAACTA CTGGGATGGA TGTTTACACA 900 

AATGTAGACC AAGAAGC TCA AAAACATCTG TGGGATATCT ACAACTCCGA 950 

40 TCAATACGTC TCTTACCCTG ACGATGATTT GCAAGTCGCA TCTACGGTCG 1000 

TAGATGTTTC AAATGGTAAA GTCATCGCCC AACTTGGAGC TCGTCACCAA 1050 

GCAAGTAACG TTTCATTTGG TACCAACCAA GCTGTGGAAA CCAATCGTGA 1100 

CTGGGGTTCT GCTATGAAAC CAATCACCGA TTATGCACCT GCCATAGAAT 1150 

ACGGTGTTTA TGATTCCACT GCAACTATGG TTAATGATAT TCC TTATAAC 1200 

45 TATCCGGGAA CAAGCACACC TGTCTACAAC TGGGATAGAG CATATTTCGG 1250 

TAATATTACT CTGCAATATG CTCTTCAACA ATCACGAAAT GTCACAGCCG 1300 

TTGAGACTTT GAATAAGGTC GGTCTAGATA GAGCTAAAAC CTTCCTTAAT 13 50 

GGTCTTGGTA TCGACTATCC AAGCATGCAT TATGCAAACG CCATTTCAAG 1400 

TAATACAACA GAATCTAATA AACAATACGG AGCAAGTAGT GAAAAAATGG 1450 

50 CTGCTGCTTA TGCTGCCTTT GCAAATGGTG GCACTTACTA TAAACCAATG 1500 

TATATCCATA AAGTCGTCTT CAGTGATGGA AGTAAAAAAG AGTTCTCTAA 1550 

TGTCGGAACT CGTGCCATGA AGGAAACGAC AGCCTATATG ATGACCGACA 1600 

TGATGAAAAC AGTCTTGACT TATGGAACTG GGCGTGGAGC CTATCTTCCT 1650 

TGGCTTCCTC AAGCTGGTAA AACAGGAACC TCTAACTATA CAGATGAGGA 1700 

55 AGTTGAAAAC CACATCAAGA ACACTGGCTA TGTAGCTCCA GATGAAATGT 1750 

TTGTTGGTTA TACTCGTAAG TATTCTATGG CTGTATGGAC AGGTTATTCG 1800 

AATCGTTTAA CTCCTATCGT TGGAGATGGT TTCCTAGTTG CAGCTAAAGT 1850 

TTATCGCTCA ATGATAACGT ATCTATCAGA AGATACTCAT CCAGAAGACT 1900 

GGACGATGCC AGACGGACTT TTCAGAAACG GGGAATTTGT ATTCAAAAAT 1950 

60 GGAGCTCGCC CAATATGGAC TGAACCCTCT ACTCAACAAT CCTCAACAGC 2000 
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TGAAAGTTCA AGCTCATCAT CAGATAGTTC 
CCACTCCAAG CACAAATAAT AGTACGACTA 
CAACAATCAA ATACAACCCC TGATCAACAA 
ACAACCATAA 



PCT/CA00/01150 

AACTTCACAG TCTAGCTCAA 2050 
CCAATCCTAA CAATAATACG 2100 
AATCAGAATC CTCAACCAGC 2150 

2160 



2) INFORMATION FOR SEQ ID NO: 1170 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1170 
ACGAATTGGA CTACGCAATT 



2) INFORMATION FOR SEQ ID NO: 1171 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA , 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1171 
ACGAGGATGA TTTGATTGTC 



2 ) INFORMATION FOR SEQ ID NO: 1172 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1560 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
{ D ) TO POLOG Y : Linear 

(ii) MOLECULE. TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 64147 

(C) ACCESSION NUMBER: X13136 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1172 

GATCCTCTAA ATGATTCTCA GGTGGCTGTT ATTGCCTCTA TTTCAAAGGA 50 

GATGCCTGGC ATTAGTATTT CTACTTCTTG GGATAGAAAG GTTTTGGAAA 100 

CTTCCCTTTC TTCTATAGTT GGGAGTGTAT CCAGTGAAAA AGCTGGTCTC 150 

CCAGCGGAAG AAGCAGAAGC CTATCTTAAA AAAGGCTATT CTCTAAATGA 200 
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CCGTGTAGGA ACCTCCTATT TGGAAAAGCA ATATGAAGAG ACCTTACAAG 250 

GAAAACGCTC GGTAAAAGAA ATCCATCTGG ACAAATATGG CAACATGGAA 300 

AGCGTGGATA CAATTGAGGA AGGTAGTAAG GGAAACAATA TCAAGCTGAC 350 

CATTGATTTG GCCTTCCAAG ATAGCGTGGA TGCTTTGCTG AAAAGTTATT 400 

5 TCAATTCAGA GTTGGGAAAT GGTGGAGCCA AGTATTCTGA AGGTGTCTAT 450 

GCAGTCGCCC TTAACCCAAA AACAGGTGCT GTTTTGTCTA TGTCAGGAAT 500 

TAAACATGAC TTGAAAACAG GAGAGTTGAC GCCGGATTCC TTGGGAACGG 550 

TAACCAATGT CTTTGTCCCA GGTTCGGTTG TCAAAGCAGC GACCATCAGC 600 

TCTGGTTGGG AAAATGGAGT CTTGTCAGGA AATCAGACCT TGACAGACCA 650 

LO GTCCATTGTC TTTCAAGGTT CAGCTCCAAT TAATTCTTGG TATCCTGCCT 700 

TTTCTAGACC AATGCCGATT ACGGCGGTTC AGGCTCTAGA GTATTCATCC 750 

AATGCTTATA TGGTCCAAAC AGCCCTAGGT CTTATGGGGC AGACCTATCA 800 

ACCCAATATG TTTGTCGGCA CCAGCAATCT AGAGTCTGCT ATGGGGAAAT 850 

TGCGTTCAAC CTTTGGTGAA TATGGTTTGG GTTCTGCGAC CGGAATTGAC 900 

15 CTACCAGATG AATCTACTGG ATTTGTTCCC AAAGAGTATA GCTTTGCTAA 950 

TTTCATTACC AATGCCTTTG GGCAGTTTGA TAACTATACG CCGATGCAGT 1000 

TGGCTCAGTA TGTAGCAACT ATTGCAAATG ATGGTGTTCG TGTGGCTCCT 1050 

CGTATTGTTG AAGGCATTTA TGGTAATAAT GATAAGGGAG GACTGGGTGA 1100 

CTTGATTCAG CAAC TGCAAC CGACAGAGAT GAATAAGGTC AATATATCCG 1150 

20 ACTCCGATAT GAGCATCTTG CACCAAGGTT TTTATCAGGT TGCCCATGGT 1200 

ACTAGTGAAT TGACAACTGG ACGTGCCTTT TCAAATGGCG CCTTGGTATC 1250 

CATTAGCGGA AAAACAGGTA CAGCCGAAAG CTATGTGGCA GATGGTCAGC 1300 

AAGCAACCAA TACCAATGCG GTGGCCTATG CCCCATCTGA TAATCCCCAA 1350 

ATCGCTGTAG CTGTTGTCTT CCCTCATAAC ACCAACCTTA CAAATGGTGT 1400 

25 CGGACCTTCC ATTGCGCGCG ATATTATCAA CCTCTATAAC CAACATCATC 1450 

CAATGAATTA GAAAGGAACA TATGCTTTAT CCAACACCTA TTGCTAAGCT 1500 

AATTGACAGT TATTCGAAGT TACCGGGTAT CGGGATTAAG ACGGCTACCC 1550 

GTTTGGCCTT 1560 



30 



2) INFORMATION FOR SEQ ID NO: 1173 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 2007 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: CS109 

45 (C) ACCESSION NUMBER: 249096 

(xi) SEQUENCE DESCRIPTION-: SEQ ID NO: 1173 

GAGTCCCGAT TGCTGAGGAT GCAACCTCCT ATAATGTCTA TGCGGTCATT 50 

50 GATGAGAACT ATAAGTCAGC AACGGGTAAG ATTCTTTACG TAGAAAAAAC 100 

ACAATTTAAC AAGGTTGCAG AGGTCTTTCA TAAGTATCTG GACATGGAAG 150 

AATCCTATGT AAGAGAGCAA CTCTCGCAAC CTAATCTCAA GCAAGTTTCC 200 

TTTGGAGCAA AGGGAAATGG GATTACCTAT GCCAATATGA TGACTATCAA 250 

AAAAGAGTTG GAAACTGCAG AGGTCAAGGG GATTGATTTT ACAACCAGTC 300 

55 CCAATCGTAG TTATCCAAAC GGACAATTTG CTTCTAGTTT TATCGGTCTA 350 

GCTCAGCTCC ATGAAAATGA AGATGGCAGC AAGAGCTTGC TGGGAACTTC 400 

TGGAATGGAG AGTTCCTTGA ACAGTATTCT TGCAGGGACA GACGGCATTA 450 

TTACCTATGA AAAGGATCGT CTGGGCAATA TTGTACCCGG AACAGAACTG 500 

GTATCGCAAC AAACTGTGGA TGGCAAGGAT GTTTATACAA CATTGTCTAG 550 

60 TCCGCTACAA TCTTTCATGG AAACTCAGAT GGATGCCTTT CTAGAAAAAG 600 
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TAAAAGGTAA GTATATGACC GCGACCTTGG TCAGTGCAAA GACCGGTGAA 650 

ATTCTCGCTA CCACCCAACG ACCTACCTTT AATGCAGATA CTAAAGAAGG 700 

AATCACTGAG GACTTTGTTT GGCGTGATAT TCTTTATCAA AGTAACTATG 750 

AACCAGGATC AGCCTTTAAG GTCATGATGT TAGCTTCTTC TATTGATAAT 800 

5 AATACCTTCC CAAGTGGAGA ATACTTCAAT AGCAGTGAAT TCAAAATAGC 850 

GGATGCGACG ACTCGAGATT GGGATGTTAA TGAGGGTTTG ACTACTGGTG 900 

GGATGATGAC TTTCTCACAA GGTTTCGCTC ACTCCAGTAA TGTTGGAACG 950 

AGTCTACTTG AACAAAAAAT GGGAGATGCT ACTTGGTTGG ATTATCTAAA 1000 ■ 

ACGCTTTAAA TTTGGGGTTC CAACTCGCTT TGGCTTGACA GATGAATACG 1050 

10 CTGGTCAACT TCCAGCTGAT AATATTGTTA GTATTGCTCA AAGCTCATTT 1100 

GGGCAAGGAA TTTCAGTGAC ACAAACACAA ATGCTTCGTG CCTTTACAGC 1150 

TATTGCTAAT GATGGAGTTA TGCTGGAGCC AAAATTTATA AGTGCTATTT 1200 

ATGATACTAA CAATCAGTCT GTACGTAAGT CACAAAAAGA AATAGTAGGA 1250 

AATCCTGTTT CCAAAGAGGC AGCAAGCACA ACTCGAAATC ACATGATCTT 1300 

15 AGTTGGGACG GACCCTCTAT ATGGAACTAT GTATAATCAC TACACAGGAA 1350 

AGCCAATTAT AACAGTTCCT GGACAAAATG TAGCAGTTAA ATCCGGTACG 1400 

GCTCAAATCG CTGATGAGAA AAATGGAGGA TACTTGGTTG GTTCTACCAA 1450 

TTATATTTTC TCAGTTGTGA CTATGAATCC TGCTGAAAAT CCTGATTTTA 1500 

TCTTGTATGT AACGGTTCAA CAGCCTGAGC ATTATTCAGG TATCCAGTTG 1550 

20 GGAGAATTTG CCACCCCAAT CTTGGAGCGG GCTTCAGCTA TGAAAGAATC 1600 

TCTCAATCTT CAATCTCCAG CCAAAAATTT AGATAAAGTT ACGACAGAAT 1650 

CTTCTTATGC AATGCCTAGC ATCAAGGATA TTTCACCTGG TGAGTTGGCG 1700 

GAAGCCTTAC GCCGAAATAT TGTGCAACCA ATCGTTGTAG GTACTGGAAC 1750 

AAAGATTAAA GAGACTTCTG TAGAAGAAGG GACCAATCTT GCACCAAACC 1800 

25 AACAAGTTCT CCTTTTATCG GATAAGGTAG AAGAAATTCC AGACATGTAT 1850 

AGCTGGAAAA AAGAGACTGC CGAGACCTTT GCTAAATGGT TGGATATTGA 1900 

ACTGGAATTT GAAGGTTCAG GTTCCGTTGT TCAGAAGCAA GATGTTCGGA 1950 

CTAATACAGC TATCAAAAAC ATTAAAAAAA TTAAATTAAC TTTAGGAGAC 2000 

TAATATG 2007 

30 

2 ) INFORMATION FOR SEQ ID NO: 1174 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1174 
45 GAACGTGGTG AAGTTCGC 18 



50 



2) INFORMATION FOR SEQ ID NO: 1175 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
55 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1175 

60 
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GTTACTGGTG TAGAAATGTT C 21 



5 2) INFORMATION FOR SEQ ID NO: 1176 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1176 

TACTGGTGTA GAAATGTTC 19 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1177 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
25. (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1177 
GCTCAACAAG TTCCAGATTA . . 20 



2) INFORMATION FOR SEQ ID NO: 1178 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 2456 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: NCTC8325 

50 (C) ACCESSION NUMBER: X52593 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1178 

ATGAACTGAT TATACTTAAC ATTAAAAAAG ATGATAACAC CTTCTACACC 50 

55 TCCATATCAC AAAAAATTAT AACATTATTT TGACATAAAT ACTACATTTG 100 

TAATATACTA CAAATGTAGT CTTATATAAG GAGGATATTG ATGAAAAAGA 150 

TAAAAATTGT TCCACTTATT TTAATAGTTG TAGTTGTCGG GTTTGGTATA 200 

TATTTTTATG CTTCAAAAGA TAAAGAAATT AATAAT AC TA TTGATGCAAT 250 

TGAAGATAAA AATTTCAAAC AAGTTTATAA AGATAGCAGT TATATTTCTA 300 

60 AAAGCGATAA TGGTGAAGTA GAAATGACTG AACGTCCGAT AAAAATATAT 350 
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AATAGTTTAG GCGTTAAAGA TATAAACATT CAGGATCGTA AAATAAAAAA 4 00 

AGTATCTAAA AATAAAAAAC GAGTAGATGC TCAATATAAA ATTAAAACAA 450 

ACTACGGTAA CATTGATCGC AACGTTCAAT TTAATTTTGT TAAAGAAGAT 500 

GGTATGTGGA AGTTAGATTG GGATCATAGC GTCATTATTC CAGGAATGCA 550 

5 GAAAGACCAA AGCATACATA TTGAAAATTT AAAATCAGAA CGTGGTAAAA 600 

TTTTAGACCG AAACAATGTG GAATTGGCCA ATACAGGAAC ACATATGAGA 650 

TTAGGCATCG TTCCAAAGAA TGTATCTAAA AAAGATTATA AAGCAATCGC 700 

TAAAGAACTA AGTATTTCTG AAGACTATAT CAACAACAAA TGGATCAAAA 750 

TTGGGTACAA GATGATACCT TCGTTCCACT TTAAAACCGT TAAAAAAATG 800 

LO GATGAATATT TAAGTGATTT CGCAAAAAAA TTTCATCTTA CAACTAATGA 850 

AACAGAAAGT CGTAACTATC CTCTAGAAAA AGCGACTTCA CATCTATTAG 900 

GTTATGTTGG TCCCATTAAC TCTGAAGAAT TAAAACAAAA AGAATATAAA . 950 

GGCTATAAAG ATGATGCAGT TATTGGTAAA AAGGGACTCG AAAAACTTTA 1000 

CGATAAAAAG CTCCAACATG AAGATGGCTA TCGTGTCACA ATCGTTGACG 1050 

L5 ATAATAGCAA TACAATCGCA CATACATTAA TAGAGAAAAA GAAAAAAGAT 1100 

GGCAAAGATA TTCAACTAAC TATTGATGCT AAAGTTCAAA AGAGTATTTA 1150 

TAACAACATG AAAAATGATT ATGGCTCAGG TACTGCTATC CACCCTCAAA 12 00 

CAGGTGAATT ATTAGCACTT GTAAGCACAC CTTCATATGA CGTCTATCCA 1250 

TTTATGTATG GCATGAGTAA CGAAGAATAT AATAAATTAA CCGAAGATAA 13 00 

20 AAAAGAACCT CTGCTCAACA AGTTCCAGAT TACAACTTCA CCAGGTTCAA 1350 

CTCAAAAAAT ATTAACAGCA ATGATTGGGT TAAATAACAA AACATTAGAC 1400 

GATAAAACAA GTTATAAAAT CGATGGTAAA GGTTGGCAAA AAGATAAATC 1450 

TTGGGGTGGT TACAACGTTA CAAGATATGA AGTGGTAAAT GGTAATATCG 1500 

ACTTAAAACA AGCAATAGAA TCATCAGATA ACATTTTCTT TGCTAGAGTA 1550 

25 GCACTCGAAT TAGGCAGTAA GAAATTTGAA AAAGGCATGA AAAAACTAGG 1600 

TGTTGGTGAA GATATACCAA GTGATTATCC ATTTTATAAT GCTCAAATTT 1650 

CAAACAAAAA TTTAGATAAT GAAATATTAT TAGCTGATTC AGGTTACGGA 1700 

CAAGGTGAAA TACTGATTAA CCCAGTACAG ATCCTTTCAA TCTATAGCGC 1750 

ATTAGAAAAT AATGGCAATA TTAACGCACC TCACTTATTA AAAGACACGA 1800 

30 AAAACAAAGT TTGGAAGAAA AATATTATTT CCAAAGAAAA TATCAATCTA 1850 

TTAAATGATG GTATGCAACA AGTCGTAAAT AAAACACATA AAGAAGATAT 1900 

TTATAGATCT TATGCAAACT TAATTGGCAA ATCCGGTACT GCAGAACTCA 1950 

AAATGAAACA AGGAGAAAGT GGCAGACAAA TTGGGTGGTT TATATCATAT 2000 

GATAAAGATA ATCCAAACAT GATGATGGCT ATTAATGTTA AAGATGTACA 2050 

35 AGATAAAGGA ATGGCTAGCT ACAATGCCAA AATCTCAGGT AAAGTGTATG 2100 

ATGAGCTATA TGAGAACGGT AATAAAAAAT ACGATATAGA TGAATAACAA 2150 

AACAGTGAAG CAATCCGTAA CGATGGTTGC TTCACTGTTT TATTATGAAT 22 00 

TATTAATAAG TGCTGTTACT TCTCCCTTAA ATACAATTTC TTCATTTTCA 2250 

TTGTATGTTG AAAGTGACAC TGTAACGAGT CCATTTTCTT TTTTTATGGA 23 00 

40 TTTCTTATTT GTAATTTCAG CGATAACGTA CAATGTATTA CCTGGTATAC 2350 

AGTTTAATAA ATTTAACGTT ATTCATTTGT GTTCCTGCTA CAACTTCTTC 2400 

TCCGTATTTA CCTTCTTCTA CCCATAATTT AAATGATATT GAAAGTGTAT 2450 

GCATGC 2456 



45 



2) INFORMATION FOR SEQ ID NO: 1179 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

55 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1179 
ATTTGGTGAC GGGTGACTTT 20 



60 



645 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



10 



20 



2) INFORMATION FOR SEQ ID NO: 1180 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1180 
15 TCCACCGTTG CCAATCGCA 19 



2) INFORMATION FOR SEQ ID NO: 1181 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1181 

30 ' 

AGCAGCTTAC TAGATGCCGT < 20 



35 2) INFORMATION FOR SEQ ID NO: 1182 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1182 

AACTGCAAGA GATCCTTTGG 20 



50 



60 



2) INFORMATION FOR SEQ ID NO: 1183 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2535 bases 
55 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

64 6 
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WO 01/23604 PCT/CA00/01 150 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 175 

(C) ACCESSION NUMBER: M18729 

5 

(xii) SEQUENCE DESCRIPTION: SEQ ID NO: 1183 

ATGGCGATAG AAAAGCTATC ACCCGGCATG CAACAGTATG TGGATATTAA 50 

10 AAAGCAATAT CCAGATGCTT TTTTGCTCTT TCGGATGGGT GATTTTTATG 100 

AATTATTTTA TGAGGATGCG GTCAATGCTG CGCAGATTCT GGAAATTTCC 150 

TTAACGAGTC GCAACAAGAA TGCCGACAAT CCGATCCCTA TGGCGGGTGT 200 

TCCCTATCAT TCTGCCCAAC AGTATATCGA TGTCTTGATT GAGCAGGGTT 250 

ATAAGGTGGC TATCGCAGAG CAGATGGAAG ATCCTAAACA AGCAGTTGGG 300 

15 GTTGTTAAAC GAGAGGTTGT TCAGGTCATT ACGCCAGGGA CAGTGGTCGA 350 

TAGCAGTAAG CCGGACAGTC AGAATAATTT TTTGGTTTCC ATAGACCGCG 400 

AAGGCAATCA ATTTGGCCTA GCTTATATGG ATTTGGTGAC GGGTGACTTT 450 

TATGTGACAG GTCTTTTGGA TTTCACGCTG GTTTGTGGGG AAATCCGTAA 500 

CCTCAAGGCT CGAGAAGTGG TGTTGGGTTA TGACTTGTCT GAGGAAGAAG 550 

20 AACAAATCCT CAGCCGCCAG ATGAAT CTGG TACTCTCTTA TGAAAAAGAA 600 

AGCTTTGAAG ACCTTCATTT ATTGGATTTG CGATTGGCAA CGGTGGAGCA 650 

AACGGCATCT AGTAAGCTGC TCCAGTATGT TCATCGGACT CAGATGAGGG 700 

AATTGAACCA CCTCAAACCT GTTATCCGCT ACGAAATTAA GGATTTCTTG 750 

CAGATGGATT ATGCGACCAA GGCTAGTCTG GATTTGGTTG AGAATGCTCG 800 

25 CTCAGGTAAG AAACAAGGCA GTCTTTTCTG GCTTTTGGAT GAAACCAAAA 850 

CGGCTATGGG GATGCGTCTC TTGCGTTCTT GGATTCATCG CCCCTTGATT 900 

GATAAGGAAC GAATCGTCCA ACGTCAAGAA GTAGTGCAGG TCTTTCTCGA 950 

CCATTTCTTT GAGCGTAGTG ACTTGACAGA CAGTCTCAAG GGTGTTTATG 1000 

ACATTGAGCG CTTGGCTAGT CGTGTTTCTT TTGGCAAAAC CAATCCAAAG 1050 

30 GATCTCTTGC AGTTGGCGAC TACCTTGTCT AGTGTGCCAC GGATTCGTGC 1100 

GATTTTAGAA GGGATGGAGC AACCTACTCT AGCCTATCTC ATCGCACAAC 1150 

TGGATGCAAT CCCTGAGTTG GAGAGTTTGA TTAGCGCAGC GATTGCTCCT 1200 

GAAGCTCCTC ATGTGATTAC AGATGGGGGA ATTATCCGGA CTGGATTTGA 1250 
TGAGACTTTA GACAAGTATC GTTGCGTTCT CAGAGAAGGG ACTAGCTGGA . 13 00 

35 TTGCTGAGAT TGAGGCTAAG GAGCGAGAAA ACTCTGGTAT CAGCACGCTC 1350 

AAGATTGACT ACAATAAAAA GGATGGCTAC TATTTTCATG TGACCAATTC 1400 

GCAACTGGGA AATGTGCCAG CCCACTTTTT CCGCAAGGCG ACGCTGAAAA 1450 

ACTCAGAACG CTTTGGAACC GAAGAATTAG CCCGTATCGA GGGAGATATG 1500 

CTTGAGGCGC GTGAGAAGTC AGCCAACCTC GAATACGAAA TATTTATGCG 1550 

40 CATTCGTGAA GAGGTCGGCA AGTACATCCA GCGTTTACAA GCTCTAGCCC 1600 

AAGGAATTGC GACGGTTGAT GTCTTACAGA GTCTGGCGGT TGTGGCTGAA 1650 

ACCCAGCATT TGATTCGACC TGAGTTTGGT GACGATTCAC AAATTGATAT 1700 

CCGGAAAGGG CGCCATGCTG TCGTTGAAAA GGTTATGGGG GCTCAGACCT 1750 

ATATTCCAAA TACGATTCAG ATGGCAGAAG ATACCAGTAT TCAATTGGTT 1800 

4 5 ACAGGGCCAA ACATGAGTGG GAAGTCTACC TATATGCGTC AGTTAGCCAT 1850 

GACGGCGGTT ATGGCCCAGC TGGGTTCCTA TGTTCCTGCT GAAAGCGCCC 1900 

ATTTACCGAT TTTTGATGCG ATTTTTACCC GTATCGGAGC AGCAGATGAC 1950 

TTGGTTTCGG GTCAGTCAAC CTTTATGGTG GAGATGATGG AGGCCAATAA 2000 

TGCCATTTCG CATGCGACCA AGAACTCTCT CATTCTCTTT GATGAATTGG 2050 

50 GACGTGGAAC TGCAACTTAT GACGGGATGG CTCTTGCTCA GTCCATCATC 2100 

GAATATATCC ATGAGCACAT CGGAGCTAAG ACCCTCTTTG CGACCCACTA 2150 

CCATGAGTTG ACTAGTCTGG AGTCTAGTTT ACAACACTTG GTCAATGTCC 2200 

ACGTGGCAAC TTTGGAGCAG GATGGGCAGG TCACCTTCCT TCACAAGATT 2250 

GAACCGGGAC CAGCTGATAA ATCCTACGGT ATCCATGTTG CCAAGATTGC 2300 

55 TGGCTTGCCA GCAGACCTTT TAGCAAGGGC GGATAAGATT TTGACTCAGC 2350 

TAGAGAATCA AGGAACAGAG AGTCCTCCTC CCATGAGACA AACTAGTGCT 2400 

GTCACTGAAC AGATTTCACT CTTTGATAGG GCAGAAGAGC ATCCTATCCT 2450 

AGCAGAATTA GCTAAACTGG ATGTGTATAA TATGACACCT ATGCAGGTTA 2500 

TGAATGTCTT AGTAGAGTTA AAACAGAAAC TATAA 2535 

60 
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2) INFORMATION FOR SEQ ID NO: 1184 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 623 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-05 

(xiii) SEQUENCE DESCRIPTION: SEQ ID NO: 1184 

TGACGGGTGA CTTTTATGTG ACAGGTCTTT TGGATTTCAC GCTGGTTTGT 50 

GGGGAAATCC GTAACCTCAA GGCTCGAGAA GTGGTGTTGG GTTATGACTT 100 

GTCTGAGGAA GAAGAACAAA TCCTCAGCCG CCAGATGAAT CTGGTACTCT 150 

CTTATGAAAA AGAAAGCTTT GAAGACCTTC ATTTATTGGA TTTGCGATTG 200 

GCAACGGTGG AGCAAACGGC ATCTAGTAAG CTGCTCCAGT ATGTTCATCG 250 

GACTCAGATG AGGGAATTGA ACCACCTCAA ACCTGTTATC CGATACGAAA 3 00 

TTAAGGATTT CTTGCAGATG GATTATGCGA CCAAGGCTAG TCTGGATTTG 350 

GTTGAGAATG CTCGCTCAGG TAAGAAACAA GGCAGTCTTT TCTGGCTTTT 4 00 

GGATGAAACC AAAACGGCTA TGGGGATGCG TCTCTTGCGT TCTTGGATTC 450 

ATCGCCCCTT GATTGATAAG GAACGAATCG TCCAACGTCA AGAAGTAGTG 500 

CAGGTCTTTC TCGACCATTT CTTTGAGCGT AGTGACTTGA CAGACAGTCT 550 

CAAGGGTGTT TATGACATTG AGCGCTTGGC TAGTCGTGTT TCTTTTGGCA 600 

AAACCAATCC AAAGGATCTC TTG ' " 623 



2) INFORMATION FOR SEQ ID NO: 1185 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 621 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-06 

(xiv) SEQUENCE DESCRIPTION: SEQ ID NO: 1185 

TGACGGGTGA CTTTTATGTG ACAGGTCTTT TGGATTTCAC GCTGGTTTGT 50 

GGGGAAATCC GTAACCTCAA GGCTCGAGAA GTGGTGTTGG GTTATGACTT 100 

GTCTGAGGAA GAAGAACAAA TCCTCAGCCG CCAGATGAAT CTGGTACTCT 150 

CTTATGAAAA AGAAAGCTTT GAAGACCTTC ATTTATTGGA TTTGCGATTG 200 

GCAACGGTGG AGCAAACGGC ATCTAGTAAG CTGCTCCAGT ATGTTCATCG 250 

GACTCAGATG AGGGAATTGA ACCACCTCAA ACCTGTTATC CGATACGAAA 3 00 

TTAAGGATTT CTTGCAGATG GATTATGCGA CCAAGGCTAG TCTGGATTTG 3 50 

GTTGAGAATG CTCGCTCAGG TAAGAAACAA GGCAGTCTTT TCTGGCTTTT 400 

GGATGAAACC AAAACGGCTA TGGGGATGCG TCTCTTGCGT TCTTGGATTC 450 

ATCGCCCCTT GATTGATAAG GAACGAATCG TCCAACGTCA AGAAGTAGTG 500 

CAGGTCTTTC TCGACCATTT CTTTGAGCGT AGTGACTTGA CAGACAGTCT 550 
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CAAGGGTGTT TATGACATTG AGCGCTTGGC TAGTCGTGTT TCTTTTGGCA 
AAACCAATCC AAAGGATCTC T 



600 
621 



LO 



15 



20 



25 



30 



35 



40 



45 



50 



55 



2) INFORMATION FOR SEQ ID NO: 1186 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-11 

(xv) SEQUENCE DESCRIPTION: SEQ ID NO: 1186 

TGACGGGTGA CTTTTATGTG ACAGGTCTTT TGGATTTCAC GCTGGTTTGT 50 

GGGGAAATCC GTAACCTCAA GGCTCGAGAA GTGGTGTTGG GTTATGACTT 100 

GTCTGAGGAA GAAGAACAAA TCCTCAGCCG CCAGATGAAT CTGGTACTCT 150 

CTTATGAAAA AGAAAGCTTT GAAGACCTTC ATTTATTGGA TTTGCGATTG 200 

GCAACGGTGG AGCAAACGGC ATCTAGTAAG CTGCTCCAGT ATGTTCATCG 250 

GACTCAGATG AGGGAATTGA ACCACCTCAA ACCTGTTATC CGCTACGAAA 300 

TTAAGGATTT CTTGCAGATG GATTATGCGA CCAAGGCTAG TCTGGATTTG 350 

GTTGAGAATG CTCGCTCAGG TAAGAAACAA GGCAGTCTTT TCTGGCTTTT 400 

GGATGAAACC AAAACGGCTA TGGGGATGCG TCTCTTGCGT TCTTGGATTC 450 

ATCGCCCCTT GATTGATAAG GAACGAATCG TCCAACGTCA AGAAGTAGTG 500 

CAGGTCTTTC TCGACCATTT CTTTGAGCGT AGTGACTTGA CAGACAGTCT 550 

CAAGGGTGTT TATGACATTG AGCGCTTGGC TAGTCGTGTT TCTTTTGGCA 600 

AAACCAATCC AAAGGATCTC TT 622 



2) INFORMATION FOR SEQ ID NO: 1187 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-55 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO : 1187 

TGACGGGTGA CTTTTATGTG ACAGGTCTTT TGGATTTCAC GCTGGTTTGT 50 

GGGGAAATCC GTAACCTCAA GGCTCGAGAA GTGGTGTTGG GTTATGACTT 100 

GTCTGAGGAA GAAGAACAAA TCCTCAGCCG CCAGATGAAT CTGGTACTCT 150 

CTTATGAAAA AGAAAGCTTT GAAGACCTTC ATTTATTGGA TTTGCGATTG 200 

GCAACGGTGG AGCAAACGGC ATCTAGTAAG CTGCTCCGGT ATGTTCATCG 250 

GACTCAGATG AGGGAATTGA ACCACCTCAA ACCTGTTATC CGCTACGAAA 300 

TTAAGGATTT CTTGCAGATG GATTATGCGA CCAAGGCTAG TCTGGATTTG 350 
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GTTGAGAATG CTCGCTCAGG TAAGAAACAA 
GGATGAAACC AAAACGGCTA TGGGGATGCG 
ATCGCCCCTT GATTGATAAG GAACGAATCG 
CAGGTCTTTC TCGACCATTT CTTTGAGCGT 
5 CAAGGGTGTT TATGACATTG AGCGCTTGGC 
AAACCAATCC AAAGGATCTC TT 



PCT/CAOO/01150 

GGCAGTCTTT TCTGGCTTTT 400 
TCTCTTGCGT TCTTGGATTC 450 
TCCAACGTCA AGAAGTAGTG 500 
AGTGACTTGA CAGACAGTCT 550 
TAGTCGTGTT TCTTTTGGCA 600 

622 



10 2) INFORMATION FOR SEQ ID NO: 1188 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 599 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 35037 
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(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 1188 



GGGTGACTTT TATGTAACGG GGCTATTGGA TTTCACGTTG GTTTGTGGGG 50 

AAATTCGCAA TCTCAAGGCT AGAGAAGTGG TGCTGGGTTA TGACTTGTCT 100 

GAGGAAGAAG AACAAATCCT CAGTCGTCAG ATGAATCTGG TGCTTTCTTA 150 

TGAGAAGGAA GGCTTTGAGG ACCTTCATTT ACTGGATCCA CGACTGGCAG 200 

30 CTGTGGAGCA AGCGGCAGCT AGTAAGCTCC TCCAGTATGT TCACCGGACC 250 

CAGATGCGGG AATTGAACCA CCTCAAACCA GTTATCCGCT^ ATGAAATCAA 300 

AGATTTCTTA CAGATGGACT ATGCGACCAA GGCTAGTCTG GATTTGGTTG 350 

AGAATGCCCG TTCAGGCAAG AAGCAAGGCA GTCTTTTCTG" GCTTTTAGAT 400 

GAAACCAAGA CGGCTATGGG AATGCGTCTC TTGCGTTCTT GGATTCATCG 450 

35 TCCTTTGATT GATAAGGAGC GAATCGTCCA GCGTCAAGAG GTGGTGCAGG 500 

TCTTTCTTGA CCACTTCTTT GAGCGTAGTG ATTTAACGGA CAGTCTTAAG 550 

GGTGTTTATG ATATCGAACG CTTGGCTAGT CGGGTTTCTT TTGGCAAGA 599 



2) INFORMATION FOR SEQ ID NO: 1189 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 624 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 49456 

55 (x) SEQUENCE DESCRIPTION: SEQ ID NO: 1189 

GGTGACGGGT GACTTTTATG TGACAGGTCT TTTGGATTTT ACGCTGGTTT 50 

GTGGGGAAAT CCGCAATCTC AAGGCTCGAG AAGTGGTGCT GGGTTATGAC 100 

TTGTCTGAGG AAGAAGAACA GATCCTTAGT CGTCAGATGA ATCTGGTACT 150 

60 TTCCTATGAA AAAGAAGGCT TTGAAGACCT TCATTTACTG GATTCACGAT 200 
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TGGCAGCTGT GGAGCAAGCG GCATCTAGTA 
CGGACTCAGA TGAGGGAATT GAACCACCTC 
AATCAAAGAT TTTTTGCAGA TGGATTATGC 
TGGTTGAGAA TGCCCGTTCA GGCAAGAAGC 
TTGGATGAAA CCAAAACAGC TATGGGAATG 
TCATCGCCCC CTGATTGATA AGGAACGAAT 
TGCAGGTCTT TCTCGACCAT TTCTTTGAGC 
CTCAAGGGTG TTTATGACAT TGAGCGCTTG 
CAAAACCAAT CCAAAGGATC TCTT 
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GGTCTTGGAT 
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TGTCCAACGC 


CAAGAAGTTG 


500 


GTAGTGATTT 


GACAGACAGT 


550 


GCTAGTCGTG 


TTTCTTTTGG 


600 
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2) INFORMATION FOR SEQ ID NO: 1190 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 599 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: LSPQ 2583 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1190 

TGACGGGTGA CTTTCAGGTG ACTAGTTTAG AGGACTTTGT CTTGGTCTGC 50 

GGGGAAATCC GCAATTTGAA AGCTAGGGAA GTGGTGCTGG GCTATGCCTT 100 

GCCAGAAGCT GAGGAGCAGG TTTTGGCTGG ACAGATGAAC CTTTTACTGT 150 

CCTATGTGGA GAAGGTTTTG GAGGATGTTC AGCTGCTGGG CGAGGAGCTG 200 

TCTCCTATGG AGCGTCAGGC AGCAGGGAAA CTGCTGGAGT ATGTGCACCG 250 

GACCCAGATG AGGGAGCTCA GCCATTTGAA GAAGGCTCAG CATTATGAAA 300 

TCAAGGACTT CCTGCAAATG GACTATGCCA CCAAGGCGAG TCTGGATTTG 350 

ACAGAAAATG CTCGCTCGGG CAAGAAGCAC GGCAGTCTTT ATTGGCTGAT 400 

GGACGAGACT AAGACGGCCA TGGGCGGCCG CATGCTGCGC TCTTGGATCC 450 

AGCGTCCGCT GATTGATGAA GCGCGAATTA GCCAGCGACA GAATGTCGTT 500 

GAGGTTTTTC TGGATCATTT CTTTGAGCGG AGTGATTTGA CGGAGAGCCT 550 

CAAGGGGGTC TATGATATCG AGCGGCTGGC TAGTCGGGTG TCTTTTGGC 599 



2) INFORMATION FOR SEQ ID NO: 1191 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 903 

(ix) SEQUENCE DESCRIPTION: SEQ ID NO: 1191 

TGACGGGTGA CTTTCAGGTG ACTAGTTTAG AGGACTTTGC CCTGGTCTGC 50 
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GGGGAAATCC GTAATTTGAA GGCTAGGGAA GTGGTGCTGG GCTATGCTTT 100 

GCCAGAAGCT GAGGAGCAGG TCTTGGCTGG ACAGATGAAT CTTTTGCTGT 150 

CCTATGTACA GACGGCCTTG GACGATGTCC AGCTGCTGGG CGAGGAACTG 200 

TCTCCTATGG AGCGTCAGGC AGCGGGGAAA TTGCTAGAGT ATGTGCACCG 250 

5 GACCCAGATG AGGGAGCTCA GCCATTTGAA GAAGGCCCAG CATTATGAAA 3 00 

TCAAGGACTT TCTGCAAATG GATTATGCTA CCAAGGCGAG TCTGGATTTG 350 

ACAGAAAATG CTCGCTCGGG TAAGAAACAC GGCAGTCTTT ATTGGCTGAT 400 

GGACGAGACC AAGACGGCCA TGGGCGGCCG TATGCTGCGC TCTTGGATCC 450 

AGCGTCCGTT GATTGATGAA GTGCGAATTA GCCAGCGGCA GAATGTCGTC 500 

10 GAGGTTTTTC TGGAACATTT CTTTGAGCGG AGTGATTTGA CGGAGAGCCT 550 

CAAGGGAGTC TATGATATCG AGCGGCTGGC TAGTCGGGTG TCTTTTGGCA 600 

AGACCAATCC AAAGGAT CTC TT 622 
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2) INFORMATION FOR SEQ ID NO: 1192 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1192 
GGTAAAACAG GAACCTCTAA CT 22 



2) INFORMATION FOR SEQ ID. NO: 1193 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 22 bases 

(B) , TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1193 
GGTAAGACAG GTACTTCTAA CT 22 

45 

2) INFORMATION FOR SEQ ID NO: 1194 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1194 
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24 



5 2) INFORMATION FOR SEQ ID NO: 1195 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

( D ) TOPOLOGY : Li near 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID .NO: 1195 

CATTTCAAGT AACACAACTG AATC 24 
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2) INFORMATION FOR SEQ ID NO: 1196 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1196 < 
GCCATTTCAA GTAATACAAC AGAA . '24 



2) INFORMATION FOR SEQ ID NO: 1197 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1197 
CAAACGCCAT TTCAAGTAAT ACAAC 25 

50 



2 ) INFORMATION FOR SEQ ID NO: 1198 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 81 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: ATCC 43867 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1198 

AACGGGCGTC TCGATAGAAA AACACGTGAA AATCCCAATG ATTATAAACA 50 

ATCAATATAC GATTTTGCTG AAGCTGTAAC AAAAGGTATT AAGGAACAAA 100 

CAAATAAAAA TTAATAGGCA ACTTAACCAG AATCGTTAAA ACTATATGAC 150 

GATTCTGGTT TTTTAAATTC AAAAAGTTTT CTAAAAAATT TACTTGCTTC 200 

TTTAAAGTAT AGGTATGAAA TACAATTGAT TAAAATAGTA AAGGAAATGA 250 

ATCATGAAAC AATTAACTAA GCCTTTATAC TTTTACCTAT TACTTTTTAT 300 

TACAACAACG CTGATTGGCG CGTTACTATT ATATTTGCCA ATCACAKGTA 350 

AACATCCTAT TGATTTTGTG GACGCCCGTT A 3 81 



2) INFORMATION FOR SEQ ID NO: 1199 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1199 
GTATTAAAGA AGATATC CAA AAAGC 



2) INFORMATION FOR SEQ ID NO: 1200 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 0 0 
TCAAAGAAGA AACTAAAAAA GCTGT 



2) INFORMATION FOR SEQ ID NO: 1201 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1201 
AACGTAGGTG TCCTTCTTC 



2 ) INFORMATION FOR SEQ ID NO: 12 02 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1202 
GTGTTGAAAT GTTCCGTAAA CA 



2) INFORMATION FOR SEQ ID NO: 1203 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1203 
GGIGARMGIG GIAAYGARAT G 



2) INFORMATION FOR SEQ ID NO: 1204 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 04 
GCIAAYAACI TCIWMYATGC C 



2) INFORMATION FOR SEQ ID NO: 1205 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1205 
AAYACITCIA WYATGCCIGT 



2) INFORMATION FOR SEQ ID NO: 1206 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
' (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1206 
CKISRIGTIG ARTCIGCCA 



2) INFORMATION FOR SEQ ID NO: 1207 

(i) SEQUENCE CHARACTERISTICS :~ 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1207 
CCITCITCWC CIGGCATYTC 



2 ) INFORMATION FOR SEQ ID NO: 1208 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1208 
TCAAAAAGTT TTCTAAAAAA TTTAC 



2) INFORMATION FOR SEQ ID NO: 1209 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 09 
ACGGGCGTCC ACAAAATCAA TAGGA 



2) INFORMATION FOR SEQ ID NO: 1210 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1210 
ACCAGCTTGC CCAATACAAA GG 



2) INFORMATION FOR SEQ ID NO: 1211 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1211 
ATTCTTGTAA CAGGCTTTGA TCCC 



' 2 ) INFORMATION FOR SEQ ID NO: 1212 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1212 
CCICCIRGIG GIGAIACIGC WCC 
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2) INFORMATION FOR SEQ ID NO: 1213 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1213 

15 

AARGGIGGIA CIGCIGCIAT HCCIGG 26 



20 2) INFORMATION FOR SEQ ID NO: 1214 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1214 

GGTAAAACAG GTACCTCTAA CTA 23 



35 



45 



50 



2) INFORMATION FOR SEQ ID NO: 1215 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1337 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 

(B) STRAIN: D471 

(C) ACCESSION NUMBER: X65717 

(xii) SEQUENCE DESCRIPTION: SEQ ID NO : 1215 



AACAAAATAA AAGAACTTAC CTATTTTCCA TCCAAAATGT TTAGCAATCA 50 

TCATCTGCAA GGCAACGTAT TGCATGGCAT TGATGTGATG AGCAACTAAT 100 

55 ATGTCATTAG AACGTTGCGT CAAACTAGCA TCTAAATAAA GATCGAAATG 150 

CAGTTATCAA AAATGCAAGC TCCTATCGGC CCTTGTTTTA ATTATTACTC 200 

ACATTGCCTT AATGTATTTA CTTGCTTATT ATTAACTTTT TTGCTAAGTT 250 

AGTAGCGTCA GTTATTCATT GAAAGGACAT TATTATGAAA ATTCTTGTAA 300 

CAGGCTTTGA TCCCTTTGGC GGCGAAGCTA TTAATCCTGC CCTTGAAGCT 350 

60 ATCAAGAAAT TGCCAGCAAC CATTCATGGA GCAGAAATCA AATGTATTGA 400 
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AGTTCCAACG GTTTTTCAAA AATCTGCCGA TGTGCTCCAG CAGCATATCG 4 50 

AAAGCTTTCA ACCTGATGCA GTCCTTTGTA TTGGGCAAGC TGGTGGCCGG 500 

ACTGGACTAA CGCCAGAACG CGTTGCCATT AATCAAGACG ATGCTCGCAT 550 

TCCTGATAAC GAAGGGAATC AGCCTATTGA TACACCTATT CGTGCAGATG 600 

5 GTAAAGCAGC TTATTTTTCA ACCTTGCCAA TCAAAGCGAT GGTTGCTGCC 650 

ATTCATCAGG CTGGGCTTCC TGCTTCTGTT TCTAATACAG CTGGTACCTT 700 

TGTTTGCAAT CATTTGATGT ATCAAGCCCT TTACTTAGTG GATAAATATT 750 

GTCCAAATGC CAAAGCTGGG TTTATGCATA TTCCCTTTAT GATGGAACAG 800 

GTTGTTGATA AACCTAATAC AGCTGCCATG AACCTCGATG ATATTACAAG 850 

10 AGGAATTGAG GCTGCTATTT TTGCCATTGT CGATTTCAAA GATCGTTCCG 900 

ATTTAAAACG TGTAGGGGGC GCTACTCACT GACTGTGACG CTACTAAACC 950 

TATTTTAAAA AAACAGAGAT ATGAACTAAC TCTGTTTTTT TTGTGCTAAA 1000 

AATGAAAGAC CTAGGGAAAC TTTTCATCGG TCTTTCTCAA TTGTCATCTT 1050 

AATCTAATAC TACTTCTAAC ATCAGCGGGT ATAGTTTGCC AGTAATTAAG 1100 

15 AAACGTTGTT GATCTAAATG AGCAAfCCCA TTCAAAACAT TAAGGTCAGG 1150 

GTAATGGGAC TTATCAAGAT TTAAGGCTTT TAACAAAGGA CTAATATCAT 1200 

AGGTGGCTAC CACCTTTCCA GAATCAGGTT GGAGTTTGAC AATAGTATTG 1250 

GTTTGCCAAA TATTGGCATA GAGATAACCA TCTACATACT CTAATTCGTT 1300 

AAGCATTGAG ATAGGGACAC TTTCTATAGC AACTAGT 1337 

20 



2) INFORMATION FOR SEQ ID NO: 1216 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1216 
35 GGTAAGACTG GTACATCAAA CTA 23 



40 



2) INFORMATION FOR SEQ ID NO: 1217 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1217 

50 

CAAATGCCAT TTCAAGTAAC ACAAC 25 



55 2) INFORMATION FOR SEQ ID NO: 1218 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 
60 (C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1218 
CAAACGCCAT TTCAAGTAAC ACAAC 
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25 



10 



L5 



20 



2) INFORMATION FOR SEQ ID NO: 1219 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1219 
CAAATGCTAT TTCAAGTAAT ACAAC 



25 



25 



30 



35 



40 



2) INFORMATION FOR SEQ ID NO: 1220 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1220 
CAAACGCCAT TTCAAGTAAT ACGAC 



25 



45 



50 



2 ) INFORMATION FOR SEQ ID NO: 1221 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1221 



55 GAYACICCIG GICAYGTIGA YTT 



23 



60 



2) INFORMATION FOR SEQ ID NO: 1222 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

<ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1222 
ATYGAYACIC CIGGICAYGT IGAYTT 



2) INFORMATION FOR SEQ ID NO: 1223 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1223 
AYITCIARRT GIARYTCRCC CATICC 



2) INFORMATION FOR SEQ ID NO: 1224 

(i) SEQUENCE CHARACTERISTICS: 

(A) .LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1224 
CCIGYIHTIY TIGARCCIAT IATG 



2 ) INFORMATION FOR SEQ ID NO: 1225 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1225 
TAICCRAACA TYTCISMIAR IGGIAC 
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10 



20 



2) INFORMATION FOR SEQ ID NO: 1226 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 0 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1226 
15 GTIRMRTAIC CRAACATYTC 20 



2) INFORMATION FOR SEQ ID NO: 1227 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1227 

30 

GTICCIYTIK CIGARATGTT YGGITA 26 



35 2) INFORMATION FOR SEQ ID NO: 1228 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 29 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1228 

GTICCIYTIK CIGARATGTT YGGITAYGC 2 9 



50 2) INFORMATION FOR SEQ ID NO: 1229 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
55 (C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1229 
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TCCATYTGIG CIGCICCIGT IATCAT 26 



5 2) INFORMATION FOR SEQ ID NO: 1230 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2145 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(D) ACCESSION NUMBER: X00415 



(xiii) SEQUENCE DESCRIPTION: SEQ ID NO: 123 0 



20 



TGAACGCCTA AAAGATAAAC GAGGAAACAA ATGGCTCGTA CAACACCCAT 50 

CGCACGCTAC CGTAACATCG GTATCAGTGC GCACATCGAC GCCGGTAAAA 100 

CCACTACTAC CGAACGTATT CTGTTCTACA CCGGTGTAAA CCATAAAATC 150 

GGTGAAGTTC ATGACGGCGC TGCAACCATG GACTGGATGG AGCAGGAGCA 2 00 

25 GGAACGTGGT ATTACCATCA CTTCCGCTGC GACTACTGCA TTCTGGTCTG 250 

GTATGGCTAA GCAGTATGAG CCGCATCGCA TCAACATCAT CGACACCCCG 3 00 

GGGCACGTTG ACTTCACAAT CGAAGTAGAA CGTTCCATGC GTGTTCTCGA 350 

TGGTGCGGTA ATGGTTTACT GCGCAGTTGG TGGTGTTCAG CCGCAGTCTG 4 00 

AAACCGTATG GCGTCAGGCA AACAAATATA AAGTTCCGCG CATTGCGTTC 4 50 

30 GTTAACAAAA TGGACCGCAT GGGTGCGAAC TTCCTGAAAG TTGTTAACCA 500 

GATCAAAACC CGTCTGGGCG CGAACCCGGT TCCGCTGCAG CTGGCGATTG 550 

GTGCTGAAGA ACATTTCACC GGTGTTGTTG ACCTGGTGAA AATGAAAGCT 600 

ATCAACTGGA ACGACGCTGA CCAGGGCGTA ACCTTCGAAT ACGAAGATAT 650 

CCCGGCAGAC ATGGTTGAAC TGGCTAACGA ATGGCACCAG AACCTGATCG 7 00 

35 AATCCGCAGC TGAAGCTTCT GAAGAGCTGA TGGAAAAATA CCTGGGTGGT 750 

GAAGAACTGA CTGAAGCAGA AATCAAAGGT GCTCTGCGTC AGCGCGTTCT 800 

GAACAACGAA ATCATCCTGG TAACCTGTGG TTCTGCGTTC AAGAACAAAG 850 

GTGTTCAGGC GATGCTGGAT GCGGTAATTG ATTACCTGCC ATCCCCGGTT 900 

GACGTACCTG CGATCAACGG TATCCTGGAC GACGGTAAAG ACACTCCGGC 950 

40 TGAACGTCAC GCAAGTGATG ACGAGCCGTT CTCTGCACTG GCGTTCAAAA 1000 

TCGCTACCGA CCCGTTTGTT GGTAACCTGA CCTTCTTCCG TGTTTACTCC 1050 

GGTGTGGTTA ACTCTGGTGA TACCGTACTG AACTCCGTGA AAGCTGCACG 1100 

TGAGCGTTTC GGTCGTATCG TTCAGATGCA CGCTAACAAA CGTGAAGAGA 1150 

TCAAAGAAGT TCGCGCGGGC GACATCGCTG CTGCTATCGG TCTGAAAGAC 1200 

45 GTAACCACTG GTGACACCCT GTGTGACCCG GATGCGCCGA TCATTCTGGA 1250 

ACGTATGGAA TTCCCTGAGC CGGTAATCTC CATCGCAGTT GAACCGAAAA 13 00 

CCAAAGCTGA CCAGGAAAAA ATGGGTCTGG CTCTGGGCCG TCTGGCTAAA 1350 

GAAGACCCGT CTTTCCGTGT ATGGACTGAC GAAGAATCTA ACCAGACCAT 14 00 

CATCGCGGGT ATGGGCGAAC TGCACCTCGA CATCATCGTT GACCGTATGA 1450 

50 AGCGTGAATT CAACGTTGAA GCGAACGTAG GTAAACCGCA GGTTGCTTAC 1500 

CGTGAAACTA TCCGCCAGAA AGTTACCGAT GTTGAAGGTA AACACGCGAA 1550 

ACAGTCTGGT GGTCGTGGTC AGTATGGTCA TGTTGTTATC GACATGTACC 1600 

CGCTGGAGCC GGGTTCAAAC CCGAAAGGCT ACGAGTTCAT CAACGACATT 1650 

AAAGGTGGTG TAATCCCTGG CGAATACATC CCGGCCGTTG ATAAAGGTAT 1700 

55 CCAGGAACAG CTGAAAGCAG GTCCGCTGGC AGGCTACCCG GTAGTAGACA 1750 

TGGGTATTCG TCTGCACTTC GGTTCTTACC ATGACGTTGA CTCCTCTGAA 1800 

CTGGCGTTTA AACTGGCTGC TTCTATCGCC TTTAAAGAAG GCTTTAAGAA 1850 

AGCGAAACCA GTTCTGCTTG AGCCGATCAT GAAGGTTGAA GTAGAAACTC 1900 

CGGAAGAGAA CACCGGTGAC GTTATCGGTG ACTTGAGCCG TCGTCGTGGT 1950 

60 ATGCTCAAAG GTCAGGAATC TGAAGTTACT GGCGTTAAGA TCCACGCTGA 2000 
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AGTACCGCTG TCTGAAATGT TCGGATACGC 
CCAAAGGTCG TGCATCATAC ACTATGGAAT 
CCGAGTAACG TTGCTCAGGC CGTAATTGAA 
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AACTCAGCTG CGTTCTCTGA 2050 
TCCTGAAGTA TGATGAAGCG 2100 
GCCCGTGGTA AATAA 2145 



2) INFORMATION FOR SEQ ID NO: 1231 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1231 
GCGAGCCCGA AGATAAAAAA GAACCTCTGC TGCTCGC 



2) INFORMATION FOR SEQ ID NO: 1232 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1232 
GGAGCCGCGC GATTTTATAA ATGAATGTTG ATAACCGGCT CC 



2) INFORMATION FOR SEQ ID NO: 1233 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1233 
GCGAGCGTTA CTGGTGTAGA AATGTTCCGG CTCGC 



2) INFORMATION FOR SEQ ID NO: 1234 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1234 
ACTAAATAAA CGCTCATTCG 20 



2) INFORMATION FOR SEQ ID NO: 1235 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 3 8 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1235 
GCGAGCCGAA GTTGAAGTTG TTGGTATTGC TGGCTCGC 38 



2 ) INFORMATION FOR SEQ ID NO: 1236 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 34 bases 

(B) TYPE : Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1236 
GCGAGCCGTG GTGAAGTTCG CGTTGGTGGC TCGC 34 

40 

2) INFORMATION FOR SEQ ID NO: 1237 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1237 
55 GCGAGCCGCG AAATCGAAGT TGCTGTATTA GGGCTCGC 38 



2) INFORMATION FOR SEQ ID NO: 1238 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1238 
GCGAGCGGCG TTAATTTTGG CACCGAAGAA GAGCTCGC 



2) INFORMATION FOR SEQ ID NO: 1239 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123 9 
GCGAGCGCAG ACCTTTCAGC AGAGGAGGCT CGC 



2) INFORMATION FOR SEQ ID NO: 1240 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124 0 
GCGAGCCGGC AAGACAATAT GACAGCAAAA TCGCTCGC 



2) INFORMATION FOR SEQ ID NO: 1241 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1241 
GCGAGCGGGG AACGAGGATG ATTTGATTGG CTCGC 
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2) INFORMATION FOR SEQ ID NO: 1242 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 600 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(E) STRAIN: BM4147-1 

(F) ACCESSION NUMBER: U3 9790 

(xiv) SEQUENCE DESCRIPTION: SEQ ID NO: 1242 

TTCTTAGAGA CATTGAATAT GCCTTATGTC GGCGCAGGCG TATTGACCAG 50 

TGCATGTGCC ATGGATAAAA TCATGACCAA GTATATTTTA CAAGCTGCTG 100 

GTGTGCCGCA AGTTCCTTAT GTACCAGTAC TTAAGAATCA ATGGAAAGAA 150 

AATCCTAAAA AAGTATTTGA TCAATGTGAA GGTTCTTTGC TTTATCCGAT 200 

GTTTGTCAAA CCTGCGAATA TGGGTTCTAG TGTCGGCATT ACAAAGGCAG 250 

AAAACCGAGA AGAGCTGCAA AATGCTTTAG CAACAGCCTA TCAGTATGAT 300 

TCTCGAGCAA TCGTTGAACA AGGAATTGAA GCGCGCGAAA TCGAAGTTGC 350 

TGTATTAGGA AATGAAGATG TTCGGACGAC TTTGCCTGGC GAAGTCGTAA 400 

AAGACGTAGC ATTCTATGAT TATGAAGCCA AATATATCAA TAATAAAATC 450 

GAAATGCAGA TTCCAGCCGA AGTGCCGGAA GAAGTTTATC AAAAAGCGCA 500 

AGAGTACGCG AAGTTAGCTT ACACGATGTT AGGTGGAAGC GGATTGAGCC 550 

GGTGCGATTT CTTTTTGACA AATAAAAATG AATTATTCCT GAATGAATTA 600 



2 ) INFORMATION FOR SEQ ID NO: 1243 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2275 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecal is 
(C) ACCESSION NUMBER: M38386 

(xv) SEQUENCE DESCRIPTION: SEQ ID NO: 1243 

GGTACCAAAG AAAAAAACGA ACGCCACAAC CAACAGCCTC TAAAGCAACA 50 

CCTGCTTCTG AAATTGAGGG AGATTTAGCA AATGTCAATG AGATTCTTTT 100 

GGTTCACGAT GATCGTGTCG GGTCAGCAAC GATGGGAATG AAAGTCTTAG 150 

AAGAAATTTT AGATAAAGAG AAAATTTCAA TGCCGATTCG AAAAATTAAT 200 

ATTAATGAAT TAACTCAACA AACACAGGCT TTAATTGTCA CAAAAGCTGA 250 

ACTAACGGAA CAAGCACGTA AAAAAGCACC GAAAGCGACA CACTTATCAG 300 

TAAAAAGTTA TGGTTAATCC CCAAAAATAT GAAACAGTGG GTTTCGCTCT 350 

TAAAAGAAAG TGCCTAGAGA GGAAGAAAAC AATGGAAAAT CTTACGAATA 400 

TTTCAATTGA ATTAAATCAA CAGTTTAATA CAAAAGAAGA AGCTATTCGC 450 

TTTTCCGGCC AGAAACTAGT CGAGGCAGGC TGTGTTGAGC CCGCTTATAT 50 0 
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CGAAGCAATG ATTGAAAGAG ACCAATTGCT ATCTGCCCAT ATGGGGAATT 550 

TTATTGCCAT TCCTCATGGA ACAGAAGAAG CCAAAAAATT AGTGAAAAAA 600 

TCAGGAATCT GTGTAGTGCA AGTCCCAGAG GGCGTTAATT TTGGCACCGA 650 

AGAAGATGAA AAAATTGCTA CCGTATTATT TGGGATTGCC GGAGTCGGTG 700 

5 AAGAACATTT GCAATTAGTC CAACAAATTG CACTTTATTG TAGTGATATG 750 

GATAACGTGG TGCAACTTGC CGATGCATTA AGTAAAGAAG AAATAACAGA 800 

AAATTTAGCC ATTGCTTAAA GGAGAGAATA AGAATGAACG CAGTACATTT 850 

TGGAGCAGGA AATATTGGAC GCGGCTTTAT TGGCGAAATT TTAGCTAAAA 900 

CGGGTTTCAT ATTACCGTTT GTGGATGTTA ATGGAAACCA TCATCAAGCG 950 

10 TTAAAAGAAC GTAAAAGTTA TACAATTGAA TTGGCCGATG CCTCACATCA 1000 

ACAAATTAAC GTTGAAAATG TGACCGGGTT AAATAACATG ACAGAACCAG 1050 

AAAAAGTAGT AGAAGCAATT GCGGAAGCCG ATTTAGTCAC GACGGCAATT 1100 

GGTCCTAATA TTTTACCAAG AATTGCTGAA TTAATTGCTC AAGGAATTGA 1150 

TGCACGTGCC GAAGCAAATT GTCAAAACGG CCCGCTGGAT ATTATCGCTT 1200 

15 GTGAAAATAT GATTGGTGGT TCAACCTTTT TAGCAGAAGA AGTGGCCATA 12 50 

ATATTTGAAA AACCCAGCTT ATCTGAACAA TGGATTGGTT TTCCTGATGC 1300 

GGCAGTTGAT CGGATTGTTC CATTACAAAA ACATAAAGAT CCACTTTTTG 1350 

TTCAAGTTGA GCCTTTTTGT GAATGGGTCA TTGATGATAC CAACCGAAAA 1400 

GCCAAAGAGA TTCAGTTAGA AGGCGTCATT ACTTGTCGAT TAGAGCCGTA 1450 

20 TATTGAACGA AAATTATTTA GTGTAACCAG TGGCCATGCT ACAGTTGCCT 1500 

ATACAGGGGC GTTGTTAGGC TATCAAACCA TTGACGAAGC GATGCAGGAC 1550 

GCCTTAGTGG TAGCGCAACT CAAATCAGTT TTGCAGGAAA CCGGTAAACT 1600 

TTTAGTGGCC AAATGGAATT TTGATGAACA AGAACATGCA GCCTATATTG 1650 

AAAAAATTAT CAACCGTTTC CAAAATAAAT ATATTTCAGA TGCTATTACA 1700 

25 CGTGTAGCAC GGACACCAAT CAGAAAATTA GGTGCGCAAG AACGGTTTAT 1750 

TCGACCAATC CGTGAATTAC AGGAACGCAA TCTAGTGTCG GCCGCATTTA 18 00 

TAGCAATGAT TGGTATTGTC TTTAATTATC ATGATCCAGA AGATGAACAA 1850 

AGCCGTCAAT TACAGGAAAT GCTTGACCAA GAAAGTGTTG ATACAGTGGA 1900 

TCGCTGAAGT AACGGGCATT GAAGATCCAG AAACGGTTAA AAATATTAAA 1950 

30 CAAAACGTAG AACTGCTATG CGCGACCACA AGTAGCATAA TTAACAAAAT 2 000 

CCTTCTACCA AGATACTTCA ' CATTTCTTAA TTAAAGAAAA AACAACCGCG 2 050 

CCTCACCTGA GCCGACCCCC AAAAGTTAGA CCTAGAAATC TAACTTTTGG 2100 

AGGTTTTTTT GTATGGCAAA ATACAGTTTT GAAATTTAAA CTTAAACTTG 2150 

TTCATGACTA CTTATATGGT CAAGGAGGTC TAAGGTTTCT CGCAAAGAAG 22 00 

35 TATGGGTTTA AAGATAGTCT CAAATAAGCA AATGGATAAA TGCCTATAAA 2250 

GAACTTGGTG AAGAAGGGGG GATCC 2275 



40 2) INFORMATION FOR SEQ ID NO: 1244 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 442 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus subsp. aureus 

(B) STRAIN: ATCC 25923 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1244 



GATCAATCTT TGTCGGTACA CGATATTCTT CACGACTAAA TAAACGCTCA 50 

TTCGCGATTT TATAAATGAA TGTTGATAAC AATGTTGTAT TATCTACTGA 100 

AATCTCATTA CGTTGCATCG GAAACATTGT GTTCTGTATG TAAAAGCCGT 150 

CTTGATAATC TTTAGTAGTA CCGAAGCTGG TCATACGAGA GTTATATTTT 200 

60 CCAGCCAAAA CGATATTTTT ATAATCATTA CGTGAAAAAG GTTTCCCTTC 250 
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ATTATCACAC AAATATTTTA GCTTTTCAGT TTCTATATCA ACTGTAGCTT 300 

CTTTATCCAT ACGTTGAATA ATTGTACGAT TCTGACGCAC CATCTTTTGC 350 

ACACCTTTAA TGTTATTTGT TTTAAAAGCA TGAATAAGTT TTTCAACACA 400 

ACGATGTGAA TCTTCTAAGA AGTCACCGTA AAATGAAGGA TC 442 
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2) INFORMATION FOR SEQ ID NO: 1245 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 845 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9444 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1245 

GATGGCGGAA AGCTACCAGA AATCTACAAC GCCCTTACGG TAAAACAGAG 50 

CAACGAAAAC GGAACAAGCA- TTAACTTAAC ATTTGAAGTT GCACTTCATT 100 

TAGGTGATGA CACAGTTCGT ACAGTTGCAA TGTCTTCCAC AGATGGACTT 150 

GTTCGTGGCA CAGAAGTAGA AGATACTGGT AAAGCAATCT CTGTACCAGT 200 

TGGTGATGCA ACACTTGGTC GTGTATTTAA CGTATTAGGT GATGCAATTG 250 

ACTTAGATGG TGAGGTTCCT GCGGATGTAC GTCGTGATCC AATTCACCGT 300 

CAAGCACCTG CATTCGAAGA ATTATCTACT AAAGTAGAAA TTCTTGAAAC 350 

TGGTATTAAA GTAGTAGACT TACTTGCTCC TTACATTAAG GGTGGTAAGA 400 

TCGGTCTATT CGGTGGTGCC GGTGTAGGTA AAACGGTATT AATTCAGGAA 450 

TTAATCAATA ACATCGCACA AGAACACGGT GGTATCTCTG TATTCGCTGG 500 

TGTAGGT.GAG CGTACTCGTG AGGGTAATGA CTTATACCAC GAAATGAGCG 550 

ATTCTGGCGT AATTAAGAAA ACTGCGATGG TATTCGGACA AATGAACGAG 600 

CCACCTGGAG CACGTCAACG TGTTGCGTTA ACAGGTTTAA CAATGGCTGA 650 

GCATTTCCGT GATGAGCAAG GACAAGATGT ACTTCTGTTC ATCGATAATA 700 

TCTTCCGTTT CACGCAAGCA GGTTCTGAAG TATCTGCCCT TCTTGGCCGT 750 

ATGCCATCTG CGGTAGGTTA CCAACCAACA CTTGCAACAG AAATGGGTCA 800 

ATTACAAGAG CGTATTACAT CTACAAATAA AGGGTCTATC ACGTC 845 



2) INFORMATION FOR SEQ ID NO: 1246 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 656 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus mycoides 

(B) STRAIN: ATCC 11986 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1246 

TGCACTTCAT TTAGGTGATG ACACAGTTCG TACAGTTGCA ATGTCTTCCA 50 
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CAGATGGACT 
TCTGTACCAG 
TGATGCAATT 
CAATTCACCG 
ATTCTTGAAA 
GGGTGGTAAG 
TAATTCAGGA 
GTATTCGCTG 
CGAAATGAGC 
AAATGAACGA 
ACAATGGCTG 
CATCGATAAC 
TTCTTG 



TGTTCGTGGC 
TTGGTGATGT 
GACTTAGATG 
TCAAGCGCCT 
CTGGTATTAA 
ATTGGTCTAT 
ATTAATTAAT 
GTGTAGGTGA 
GATTCTGGCG 
GCCACCTGGA 
AACATTTCCG 
ATCTTCCGTT 



ACAGAAGTAG 
AACACTTGGT 
GTGATGTTCC 
GCATTCGAAG 
AGTAGTAGAC 
TCGGTGGTGC 
AACATCGCAC 
GCGTACTCGT 
TAATTAAGAA 
GCACGTCAAC 
TGATGAGCAA 
TCACGCAAGC 



AAGATACTGG TAAAGCAATC 
CGTGTATTTA ACGTATTAGG 
TGCGGATGTA CGTCGTGATC 
AGTTATCTAC TAAAGTAGAA 
TTACTTGCTC CTTACATTAA 
CGGCGTAGGT AAAACAGTAT 
AAGAGCACGG TGGTATCTCT 
GAAGGTAACG ACTTATACCA 
AACTGCGATG GTATTCGGAC 
GTGTTGCATT AACAGGTTTA 
GGACAAGACG TACTATTGTT 
GGGTTCTGAA GTATCTGCCC 



100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
656 



10 



15 



20 



25 



30 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1247 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 791 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE:. Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) - STRAIN : ATCC 10792 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:' 1247 

CGAAAACGGA AGTATTAACT TAACATTTGA AGTTGCACTT CATTTAGGTG 50 

ATGATACAGT TCGTACAGTT GCGATGTCTT CCACAGATGG ACTTGTTCGT 100 

GGCACAGAAG TAGAAGATAC TGGTAAACCA ATCTCTGTAC CAGTTGGTGA 150 

TGTAACACTT GGTCGCGTAT TTAACGTATT AGGTGATGCA ATTGACTTAG 200 

ATGGTGAGGT TCCTGCAGAT GTACATCGTG ATCCAATTCA CCGTCAAGCA 250 

CCTGCATTCG AAGAATTATC TACTAAAGTA GAAATTCTTG AAACTGGTAT 300 

TAAAGTAGTA GACTTACTTG CTCCTTACAT TAAGGGTGGT AAGATCGGCC 350 

TATTCGGTGG TGCCGGCGTA GGTAAAACAG TATTAATTCA GGAATTAATT 400 

AACAACATCG CACAAGAGCA CGGTGGTATC TCTGTATTCG CTGGTGTAGG 450 

TGAGCGTACT CGTGAGGGTA ATGACTTATA CCACGAAATG AGCGATTCTG 500 

GCGTAATCAA GAAAACTGCG ATGGTATTCG GACAAATGAA CGAGCCACCT 550 

GGAGCACGTC AACGTGTTGC ATTAACAGGT TTAACAATGG CTGAGCATTT 600 

CCGTGATGAG CAAGGACAAG ACGTACTTCT GTTCATCGAT AACATCTTCC 650 

GTTTCACGCA AGCGGGTTCT GAAGTATCTG CCCTTCTTGG TCGTATGCCA 700 

TCTGCGGTAG GTTACCAACC AACACTTGCA ACAGAAATGG GTCAATTACA 750 

AGAGCGTATT ACATCTACAA ATAAAGGGTC TATCACGTCT A 7 91 • 



2) INFORMATION FOR SEQ ID NO: 1248 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 ( D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: BGSC 4AC1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1248 

15 ATCTACAATG CCCTTACGGT AAAACAAAGC AACGAAAACG GAAGCATGAA 50 

CTTAACATTT GAAGTTGCAC TTCATTTAGG TGATGATACA GTTCGTACAG 100 

TTGCGATGTC TTCCACAGAT GGACTTGTTC GTGGCACAGA AGTAGAAGAT 150 

ACTGGTAAAG CAATCTCTGT ACCAGTTGGT GATGCAACAC TTGGACGTGT 200 

ATTCAACGTA TTAGGTGATG CAATTGACTT AGATGGTGAA CTTCCTGCGG 250 

2 0 ATGTACACCG TGATCCAATT CACCGTCAAG CACCTGCATT CGAAGAATTA 300 

TCTACTAAAG TAGAAATTCT TGAAACTGGT ATTAAAGTAG TAGACTTACT 350 

TGCTCCTTAC ATTAAGGGTG GTAAGATCGG CCTATTCGGT GGTGCCGGCG 400 

TAGGTAAAAC AGTATTAATT CAGGAGTTAA TCAATAACAT CGCACAAGAG 4 50 

CACGGTGGTA TCTCTGTATT CGCTGGTGTA GGTGAGCGTA CTCGTGAGGG 500 

25 TAATGACTTA TACCACGAAA TGAGCGATTC TGGCGTAATC AAGAAAACTG 550 

CGATGGTATT CGGACAAATG AACGAGCCAC CTGGAGCACG TCAACGTGTT 600. 

GCATTAACAG GTTTAACAAT GGCTGAGCAT TTCCGTGATG AGCAAGGACA 650 

AGACGTACTT CTGTTCATCG ATAACATCTT CCGTTTCACG CAAGCGGGTT 700 

CTGAAGTATC TGCCCTTCTT GGTCGTATGC CATCTGCGGT AGGTTACCAA 7 50 

3 0 CCAACACTTG CAACAGAAAT GGGTCAATTA CAAGAGCGTA TTACATCTAC 800 

AAATAAAGGG TCTATCACGT CTATC 825 



35 2) INFORMATION FOR SEQ ID NO: 1249 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 775 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1236 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1249 

ATCTACAACG CCCTTACGGT AAAACAAAGC AACGAAAACG GAAGTATTAA 50 
CTTAACATTT GAAGTTGCAC TTCATTTAGG TGATGATACA GTTCGTACAG 100 
TTGCGATGTC TTCCACAGAT GGACTTGTTC GTGGCACAGA AGTAGAAGAT 150 

672 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



ACTGGTAAAC 
ATTTAACGTA 
ATGTACATCG 
TCTACTAAAG 
TGCTCCTTAC 
TAGGTAAAAC 
CACGGTGGTA 
TAATGACTTA 
CGATGGTATT 
GCATTAACAG 
AGACGTACTT 
CTGAAGTATC 
CCAACACTTG 



CAATCTCTGT 
TTAGGTGATG 
TGATCCAATT 
TAGAAATTCT 
ATTAAGGGTG 
AGTATTAATT 
TCTCTGTATT 
TACCACGAAA 
CGGACAAATG 
GTTTAACAAT 
CTGTTCATCG 
TGCCCTTCTT 
CAACAGAAAT 



ACCAGTTGGT 
CAATTGACTT 
CACCGTCAAG 
TGAAACTGGT 
GTAAGATCGG 
CAGGAATTAA 
CGCTGGTGTA 
TGAGCGATTC 
AACGAGCCAC 
GGCTGAGCAT 
ATAACATCTT 
GGTCGTATGC 
GGGTC 



GATGTAACAC 
AGATGGTGAG 
CACCTGCATT 
ATTAAAGTAG 
CCTATTCGGT 
TTAACAACAT 
GGTGAGCGTA 
TGGCGTAATC 
CTGGAGCACG 
TTCCGTGATG 
CCGTTTCACG 
CATCTGCGGT 



TTGGTCGCGT 
GTTCCTGCAG 
CGAAGAATTA 
TAGACTTACT 
GGTGCCGGCG 
CGCACAAGAG 
CTCGTGAGGG 
AAGAAAACTG 
TCAACGTGTT 
AGCAAGGACA 
CAAGCGGGTT 
AGGTTACCAA 



200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
775 



10 



15 



20 



25 



30 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1250 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus weihenstephanensis 

(B) STRAIN: WSBC 10204 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1250 

CCAGCAATCT ACAACGCCCT TACGGTAAAA CAAAGCAACG AAAAC GGAGC 50 

GAGCATCAAC TTAACATTTG AAGTTGCACT TCATTTAGGT GATGACACAG 100 

TTCGTACAGT TGCGATGTCT TCCACAGATG GACTTGTTCG TGGCACAGAA 150 

GTAGAAGATA CTGGTAAAGC AATCTCTGTA CCAGTTGGTG ATGTAACACT 200 

TGGTCGCGTA TTCAACGTAT TAGGTGATGC AATTGACTTA GATGGTGATG 250 

TTCCTGCGGA TGTACGTCGT GATCCAATTC ACCGTCAAGC ACCTGCATTC 300 

GAAGAACTAT CTACAAGAAT AGAAATTCTT GAAACTGGTA TTAAAGTAGT 350 

AGATTTACTT GCTCCTTACA TTAAGGGTGG TAAGATCGGT CTATTCGGTG 400 

GTGCCGGTGT AGGTAAAACG GTATTAATTC AGGAATTAAT TAACAACATC 450 

GCACAAGAGC ACGGTGGTAT CTCTGTATTC GCTGGTGTAG GTGAGCGTAC 500 

TCGTGAGGGT AATGACTTAT ACCACGAAAT GAGCGATTCT GGCGTAATTA 550 

AGAAAACTGC GATGGTATTT GGACAAATGA ACGAGCCACC TGGAGCACGT 600 

CAACGTGTTG CATTAACAGG TTTAACAATG GCTGAACATT TCCGTGATGA 650 

GCAAGGACAA GACGTACTAT TGTTCATCGA TAACATCTTC CGTTTCACGC 700 

AAGCAGGTTC TGAAGTATCT GCCCTTCTTG GTCGTATGCC ATCTGCGGTA 750 

GGTTACCAAC CAACACTTGC AACAGAAATG GGTCAATTAC AAGAGCGTAT 800 

TACATCTACA AATAAAGGGT CTATCACGTC TA 832 



2) INFORMATION FOR SEQ ID NO: 1251 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 802 bases 

(B) TYPE: Nucleic acid 
<C) STRANDEDNESS: Double 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1418 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1251 

AAATCTACAA CGCCCTTACG GTAAAACAAA GCAACGAAAA CGGAAGTATT 50 

AACTTAACAT TTGAAGTTGC ACTTCATTTA GGTGATGATA CAGTTCGTAC 100 

AGTTGCGATG TCTTCCACAG ATGGACTTGT TCGTGGCACA GAAGTAGAAG 150 

ATACTGGTAA ACCAATCTCT GTACCAGTTG GTGATGTAAC ACTTGGTCGC 200 

GTATTTAACG TATTAGGTGA TGCAATTGAC TTAGATGGTG AGGTTCCTGC 250 

AGATGTACAT CGTGATCCAA TTCACCGTCA AGCACCTGCA TTCGAAGAAT 300 

TATCTACTAA AGTAGAAATT CTTGAAACTG GTATTAAAGT AGTAGACTTA 350 

CTTGCTCCTT ACATTAAGGG TGGTAAGATC GGCCTATTCG GTGGTGCCGG 400 

CGTAGGTAAA ACAGTATTAA TTCAGGAATT AATTAACAAC ATCGCACAAG 450 

AGCACGGTGG TATCTCTGTA TTCGCTGGTG TAGGTGAGCG TACTCGTGAG 500 

GGTAATGACT TATACCACGA AATGAGCGAT TCTGGCGTAA TCAAGAAAAC 550 

TGCGATGGTA TTCGGACAAA TGAACGAGCC ACCTGGAGCA CGTCAACGTG 600 

TTGCATTAAC AGGTTTAACA ATGGCTGAGC ATTTCCGTGA TGAGCAAGGA 650 

CAAGACGTAC TTCTGTTCAT' CGATAACATC TTCCGTTTCA CGCAAGCGGG 700 

TTCTGAAGTA TCTGCCCTTC TTGGTCGTAT GCCATCTGCG GTAGGTTACC 750 

AACCAACACT TGCAACAGAA ATGGGTCAAT TACAAGAGCG TATTACATCT 800 

AC 802 



2) INFORMATION FOR SEQ ID NO: 1252 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1410 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1252 

AAATCTACAA CGCCCTTACG GTAAAACAAA GCAACGAAAA CGGAAGTATT 50 
AACTTAACAT TTGAAGTTGC ACTTCATTTA GGTGATGATA CAGTTCGTAC 100 
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ATACTGGTAA ACCAATCTCT GTACCAGTTG 
GTATTTAACG TATTAGGTGA TGCAATTGAC 
AGATGTACAT CGTGATCCAA TTCACCGTCA 
5 TATCTACTAA AGTAGAAATT CTTGAAACTG 
CTTGCTCCTT ACATTAAGGG TGGTAAGATC 
CGTAGGTAAA ACAGTATTAA TTCAGGAATT 
AGCACGGTGG TATCTCTGTA TTCGCTGGTG 
GGTAATGACT TATACCACGA AATGAGCGAT 

10 TGCGATGGTA TTCGGACAAA TGAACGAGCC 
TTGCATTAAC AGGCTTAACA ATGGCTGAGC 
CAAGACGTAC TTCTGTTCAT CGATAACATC 
TTCTGAAGTA TCTGCCCTTC TTGGTCGTAT 
AACCAACACT TGCAACAGAA ATGGGTCAAT 

15 ACAAATAAAG GGTCTATCAC GTC 



PCT/CAOO/01150 

TCGTGGCACA GAAGTAGAAG 150 

GTGATGTAAC ACTTGGTCGC 200 

TTAGATGGTG AGGTTCCTGC 250 

AGCACCTGCA TTCGAAGAAT 300 

GTATTAAAGT AGTAGACTTA 350 

GGCCTATTCG GTGGTGCCGG 400 

AATTAACAAC ATCGCACAAG 4 50 ' 

TAGGTGAGCG TACTCGTGAG 500 

TCTGGCGTAA TCAAGAAAAC 550 

ACCTGGAGCA CGTCAACGTG 600 

ATTTCCGTGA TGAGCAAGGA 650 

TTCCGTTTCA CGCAAGCGGG 700 

GCCATCTGCG GTAGGTTACC 7 50 

TACAAGAGCG TATTACATCT 800 

823 



2) INFORMATION FOR SEQ ID NO: 1253 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE : 
30 (A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 13472 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1253 

35 CCAGAAATCT ACAATGCCCT TACGGTAAAA CAAAGCAACG AAAACGGAAG 50 

CATGAACTTA ACATTTGAAG TTGCACTTCA TTTAGGTGAT GATACAGTTC 100 

GTACAGTTGC GATGTCTTCC ACAGATGGAC TTGTTCGTGG CACAGAAGTA 150 

GAAGATACTG GTAAAGCAAT CTCTGTACCA GTTGGTGATG CAACACTTGG 200 

ACGTGTATTC AACGTATTAG GTGATGCAAT TGACTTAGAT GGTGAACTTC 250 

40 CTGCGGATGT ACACCGTGAT CCAATTCACC GTCAAGCACC TGCATTCGAA 300 

GAATTATCTA CTAAAGTAGA AATTCTTGAA ACTGGTATTA AAGTAGTAGA 350 

CTTACTTGCT CCTTACATTA AGGGTGGTAA GATCGGCCTA TTCGGTGGTG 400 

CCGGCGTAGG TAAAACAGTA TTAATTCAGG AATTAATCAA TAACATCGCA 4 50 

CAAGAGCATG GTGGTATCTC TGTATTCGCT GGTGTAGGTG AGCGTACTCG 500 

45 TGAGGGTAAT GACTTATACC ACGAAATGAG CGATTCTGGC GTAATCAAGA 550 

AAACTGCGAT GGTATTCGGA CAAATGAACG AGCCACCTGG AGCACGTCAA 600 

CGTGTTGCAT TAACAGGTTT AACAATGGCT GAGCATTTCC GTGATGAGCA 650 

AGGACAAGAC GTACTTCTGT TCATCGATAA CATCTTCCGT TTCACGCAAG 700 

CGGGTTCTGA AGTATCTGCC CTTCTTGGTC GTATGCCATC TGCGGTAGGT 750 

50 TACCAACCGA CACTTGCAAC AGAAATGGGT CAATTACAAG AGCGTATT 798 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 767 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 7064 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1254 

CATTTGAAGT TGCACTTCAT TTAGGTGATG ACACAGTTCG TACAGTTGCA 50 

ATGTCTTCCA CAGATGGACT TGTTCGTGGC ACAGAAGTAG AAGATACTGG 100 

TAAAGCAATC TCTGTACCAG TTGGTGATGC AACACTTGGT CGTGTATTTA 150 

20 ACGTATTAGG TGATGCAATT GACTTAGATG GTGAGGTTCC TGCGGATGTA 200 

CGTCGTGATC CAATTCACCG TCAAGCACCT GCATTCGAAG AATTATCTAC 250 

TAAAGTAGAA ATTCTTGAAA CTGGTATTAA AGTAGTAGAC TTACTTGCTC 300 

CTTACATTAA GGGTGGTAAG ATCGGTCTAT TCGGTGGTGC CGGTGTAGGT 350 

AAAACGGTAT TAATTCAGGA ATTAATCAAT AACATCGCAC AAGAACACGG 400 

25 TGGTATCTCT GTATTCGCTG GTGTAGGTGA GCGTACTCGT GAGGGTAATG 4 50 

ACTTATACCA CGAAATGAGC GATTCTGGCG TAATTAAGAA AACTGCGATG 500 

GTATTCGGAC AAATGAACGA GCCACCTGGA GCACGTCAAC GTGTTGCGTT 550 

AACAGGTTTA ACAATGGCTG AGCATTTCCG TGATGAGCAA GGACAAGACG 600 

TACTTCTGTT CATCGATAAT ATCTTCCGTT TCACGCAAGC AGGTTCTGAA 650 

30 GTATCTGCCC TTCTTGGCCG TATGCCATCT GCGGTAGGTT ACCAACCAAC 700 

ACTTGCAACA GAAATGGGTC AATTACAAGA GCGTATTACA TCTACAAATA 750 

AAGGGTCTAT CACGTCT 767 



35 



45 



2) INFORMATION FOR SEQ ID NO: 1255 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1174 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: C-14 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1255 

GAAATGCGTG AATCATTTTT AGATTATGCG ATGAGTGTTA TCGTTGCTCG 50 
TGCATTGCCA GATGTTCGTG ACGGTTTAAA ACCAGTACAT CGTCGTATAC 100 

676 

SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



TATATGGATT AAATGAACAA GGTATGACAC CGGATAAATC ATATAAAAAA 150 

TCAGCACGTA TCGTTGGTGA CGTAATGGGT AAATATCACC CTCATGGTGA 200 

CTTATCTATT TATGAAGCAA TGGTACGTAT GGCTCAAGAT TTCAGTTATC 250 

GTTATCCGCT TGTTGATGGC CAAGGTAACT TTGGTTCAAT GGATGGAGAT 300 

5 GGCGCAGCAG CAATGCGTTA TACTGAAGCG CGTATGACTA AAATCACACT 350 

TGAACTGTTA CGTGATATTA ATAAAGATAC AATAGATTTT ATCGATAACT 400 

ATGATGGTAA TGAAAGAGAG CCGTCAGTCT TACCTGCTCG ATTCCCTAAC 450 

TTATTAGCCA ATGGTGCATC AGGTATCGCG GTAGGTATGG CAACGAATAT 500 

TCCACCACAT AACTTAACAG AATTAATCAA TGGTGTACTT AGCTTAAGTA 550 

10 AGAACCCTGA TATTTCAATT GCTGAGTTAA TGGAGGATAT TGAAGGTCCT 600 

GATTTCCCAA CTGCTGGACT TATTTTAGGT AAGAGTGGTA TTAGACGTGC 650 

ATATGAAACA GGTCGTGGTT CAATTCAAAT GCGTTCTCGT GCAGTTATTG 700 

AAGAACGTGG AGNCGGACGT CAACGTATTG TTGTCACTGA AATTCCTTTC 750 

CAAGTGAATA AGGCTCGTAT GATTGAAAAA ATTGCAGAGC TCGTTCGTGA 800 

15 CAAGAAAATT GACGGTATCA CTGATTTACG TGATGAAACA AGTTTACGTA 850 

CTGGTGTGCG TGTCGTTATT GATGTGCGTA AGGATGCAAA TGCTAGTGTC 900 

ATTTTAAATA ACTTATACAA ACAAACACCT CTTCAAACAT CATTTGGTGT 950 

GAATATGATT GCACTTGTAA ATGGTAGACC GAAGCTTATT AATTTAAAAG 1000 

AAGCGTTGGT ACATTATTTA GAGCATCAAA AGACAGTTGT TAGAAGACGT 1050 

20 ACGCAATACA ACTTACGTAA AGCTAAAGAT CGTGCCCACA TTTTAGAAGG 1100 

ATTACGTATC GCACTTGACC ATATCGATGA AATTATTTCA ACGATTCGTG 1150 

AGTCAGATAC AGATAAAGTT GCAA 1174 



25 



35 



2) INFORMATION FOR SEQ ID NO: 1256 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 780 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus weihenstephanensis 

(B) STRAIN: WSBC 10209 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1256 

ATCTACAACG CCCTTACGGT AAAACAAAGC AACGAAAACG GAGCAAGCAT 50 

TAACTTAACA TTTGAAGTTG CACTTCATTT AGGTGATGAC ACAGTTCGTA 100 

CAGTTGCAAT GTCTTCCACA GATGGACTTG TTCGTGGCAC AGAAGTAGAA 150 

45 GATACTGGTA AAGCAATCTC TGTACCAGTT GGTGATGTAA CACTTGGTCG 200 

TGTATTTAAC GTATTAGGTG ATGCAATTGA CTTAGATGGA GATGTTCCTG 250 

CGGATGTACG TCGTGATCCA ATTCACCGTC AAGCGCCTGC ATTCGAAGAG 300 

TTATCTACTA AAGTAGAAAT TCTTGAAACT GGTATTAAAG TAGTAGACTT 350 

ACTTGCTCCT TACATTAAGG GTGGTAAGAT CGGTCTATTC GGTGGTGCCG 4 00 

50 GTGTAGGTAA AACAGTATTA ATTCAGGAAT TAATTAACAA CATCGCACAA 450 

GAGCACGGTG GTATCTCTGT ATTCGCTGGT GTAGGTGAGC GTACTCGTGA 500 

AGGTAACGAC TTATACCACG AAATGAGCGA TTCTGGCGTA ATTAAGAAAA 550 

CTGCGATGGT ATTCGGACAA ATGAACGAGC CACCTGGAGC ACGTCAACGT 600 
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GTTGCATTAA CAGGTTTAAC AATGGCTGAA CATTTCCGTG ATGAGCAAGG 
GCAAGACGTA CTATTGTTCA TCGATAACAT CTTCCGTTTC ACGCAAGCGG 
GTTCTGAAGT ATCTGCCCTT CTTGGTCGTA TGCCATCTGC GGTAGGTTAC 
CAGCCAACAC TTGCAACAGA AATGGGTCAA 



650 
700 
750 
780 
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2) INFORMATION FOR SEQ ID NO: 1257 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 7700 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1257 

CTACCAGAAA TCTACAACGC CCTTACGGTA AAACAGAGCA ACGAAAACGG 50 

AACAAGCATT AACTTAACAT TTGAAGTTGC ACTTCATTTA GGTGATGACA 100 

CAGTTCGTAC AGTTGCAATG TCTTCCACAG ATGGACTTGT TCGTGGCACA 150 

GAAGTAGAAG ATACTGGTAA AGCAATCTCT GTACCAGTTG , GTGATGCAAC 200 

ACTTGGTCGT GTATTTAACG TATTAGGTGA TGCAATTGAC TTAGATGGTG 250 

AGGTTCCTGC GGATGTACGT CGTGATCCAA TTCACCGTCA AGCACCTGCA 300 

TTCGAAGAAT TATCTACTAA AGTAGAAATT CTTGAAACTG GTATTAAAGT 350 

AGTAGACTTA CTTGCTCCTT ACATTAAGGG TGGTAAGATC GGTCTATTCG 400 

GTGGTGCCGG TGTAGGTAAA ACGGTATTAA TTCAGGAATT AATCAATAAC 4 50 

ATCGCACAAG AACACGGTGG TATCTCTGTA TTCGCTGGTG TAGGTGAGCG 500 

TACTCGTGAG GGTAATGACT TATACCACGA AATGAGCGAT TCTGGCGTAA 550 

TTAAGAAAAC TGCGATGGTA TTCGGACAAA TGAACGAGCC ACCTGGAGCA 600 

CGTCAACGTG TTGCGTTAAC AGGTTTAACA ATGGCTGAGC ATTTCCGTGA 650 

TGAGCAAGGA CAAGATGTAC TTCTGTTCAT CGATAATATC TTCCGTTTCA 700 

CGCAAGCAGG TTCTGAAGTA TCTGCCCTTC TTGGCCGTAT GCCATCTGCG 750 

GTAGGTTACC AACCAACACT TGCAACAGAA ATGGGTCAAT TACAAGAGCG 800 

TATTACATCT ACAAATA 817 



2) INFORMATION FOR SEQ ID NO: 1258 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 829 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1404 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1258 

CCAGAAATCT ACAACGCCCT TACGGTAAAA CAAAGCAACG AAAACGGAAG 50 

TATTAACTTA ACATTTGAAG TTGCACTTCA TTTAGGTGAT GATACAGTTC 100 

GTACAGTTGC GATGTCTTCC ACAGATGGAC TTGTTCGTGG CACAGAAGTA 150 

GAAGATACTG GTAAACCAAT CTCTGTACCA GTTGGTGATG TAACACTTGG 200 

TCGCGTATTT AACGTATTAG GTGATGCAAT TGACTTAGAT GGTGAGGTTC 250 

CTGCAGATGT ACATCGTGAT CCAATTCACC GTCAAGCACC TGCATTCGAA 300 

GAATTATCTA CTAAAGTAGA AATTCTTGAA ACTGGTATTA AAGTAGTAGA 350 

CTTACTTGCT CCTTACATTA AGGGTGGTAA GATCGGCCTA TTCGGTGGTG 4 00 

CCGGCGTAGG TAAAACAGTA TTAATTCAGG AATTAATTAA CAACATCGCA 4 50 

CAAGAGCACG GTGGTATCTC TGTATTCGCT GGTGTAGGTG AGCGTACTCG 500 

TGAGGGTAAT GACTTATACC ACGAAATGAG CGATTCTGGC GTAATCAAGA 550 

AAACTGCGAT GGTATTCGGA CAAATGAACG AGCCACCTGG AGCACGTCAA 600 

CGTGTTGCAT TAACAGGTTT AACAATGGCT GAGCATTTCC GTGATGAGCA 650 

AGGACAAGAC GTACTTCTGT TCATCGATAA CATCTTCCGT TTCACGCAAG 700 

CGGGTTCTGA AGTATCTGCC CTTCTTGGTC GTATGCCATC TGCGGTAGGT 750 

TACCAACCAA CACTTGCAAC AGAAATGGGT CAATTACAAG AGCGTATTAC 800 

ATCTACAAAT AAAGGGTCTA TCACGTCTA 829 



2) INFORMATION FOR SEQ ID NO: 1259 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 bases' 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 15816 

(C) ACCESSION NUMBER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1259 

TGGCGGAAAG CTACCAGAAA TCTACAACGC CCTTACGGTA AAACAGAGCA 50 

ACGAAAACGG TGAACTTAAC TTAACATTTG AAGTTGCACT TCATTTAGGT 100 

GATGATACAG TTCGTACAGT TGCGATGTCT TCCACAGATG GACTTGTTCG 150 

TGGCACAGAA GTAGAAGATA CTGGTAAAGC AATCTCTGTA CCAGTTGGTG 200 

ATGCAACACT TGGTCGCGTA TTTAACGTAT TAGGTGATGC TATTGACTTA 250 

GATGGTGAGG TTCCTGCGGA TGTACGTCGT GATCCAATTC ACCGTCAAGC 300 

ACCTGCATTC GAAGAATTAT CTACTAAAGT AGAAATTCTT GAAACTGGTA 350 

TTAAAGTAGT AGACTTACTT GCTCCTTACA TTAAGGGTGG TAAGATCGGC 4 00 

CTATTCGGTG GTGCCGGTGT AGGTAAAACA GTATTAATTC AGGAGTTAAT 4 50 

CAACAACATC GCACAAGAGC ACGGTGGTAT CTCTGTATTC GCTGGTGTAG 500 
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GTGAGCGTAC TCGTGAGGGT 
GGCGTAATTA AGAAAACTGC 
TGGAGCACGT CAACGTGTTG 
TCCGTGATGA GCAAGGACAA 
CGTTTCACGC AAGCAGGTTC 
ATCTGCGGTA GGTTACCAAC 
AAGAGCGTAT TACATCTACA 



AATGACTTAT ACCACGAAAT GAGCGATTCT 
GATGGTATTC GGACAAATGA ACGAGCCACC 
CATTAACAGG CTTAACAATG GCTGAATATT 
GACGTACTTC TGTTCATCGA TAATATCTTC 
TGAAGTATCT GCCCTTCTTG GCCGTATGCC 
CAACACTTGC AACAGAAATG GGTCAATTAC 
AATAAAGGGT CTATCACGTC TATC 



550 
600 
650 
700 
750 
800 
844 
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2) INFORMATION FOR SEQ ID NO: 1260 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 840 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 49064 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1260 

AAGCTACCAG AAATCTACAA CGCCCTTACG GTAAAACAGA .GCAACGAAAA 50 

CGGAACAAGC ATTAACTTAA CATTTGAAGT TGCACTTCAT TTAGGTGATG 100 

ACACAGTTCG TACAGTTGCA ATGTCTTCCA CAGATGGACT TGTTCGTGGC 150 

ACAGAAGTAG AAGATACTGG TAAAGCAATC TCTGTACCAG TTGGTGATGC 200 

AACACTTGGT CGTGTATTTA ACGTATTAGG TGATGCAATT GACTTAGATG 250 

GTGAGGTTCC TGCGGATGTA CGCCGTGATC CAATTCACCG TCAAGCACCT 300 

GCATTCGAAG AATTATCTAC TAAAGTAGAA ATTCTTGAAA CTGGTATTAA 350 

AGTAGTAGAC TTACTTGCTC CTTACATTAA GGGTGGTAAG ATCGGTCTAT 400 

TCGGTGGTGC CGGTGTAGGT AAAACAGTAT TAATTCAGGA ATTAATCAAC 450 

AACATCGCAC AAGAACACGG TGGTATCTCT GTATTCGCTG GTGTAGGTGA 500 

GCGTACTCGT GAGGGTAATG ACTTATACCA CGAAATGAGC GATTCAGGCG 550 

TAATTAAGAA AACTGCGATG GTATTCGGAC AAATGAACGA GCCACCTGGA 600 

GCGCGTCAAC GTGTTGCGTT AACAGGTTTA ACAATGGCTG AGCATTTCCG 650 

TGATGAGCAA GGACAAGACG TTCTTCTGTT CATCGATAAT ATCTTCCGTT 700 

TCACGCAAGC AGGTTCTGAA GTATCTGCCC TTCTTGGTCG TATGCCATCT 750 

GCGGTAGGTT ACCAACCAAC ACTTGCAACA GAAATGGGTC AATTACAAGA 800 

GCGTATTACA TCTACAAATA AAGGGTCTAT CACGTCTATC 840 



2) INFORMATION FOR SEQ ID NO: 1261 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 839 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
5 (A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: BGSC 4AZ1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1261 

10 GCGGAAAGCT ACCAGAAATC TACAATGCCC TTACGGTAAA ACAAAGCAAC 50 

GAAAACGGAA GCATGAACTT AACATTTGAA GTTGCACTTC ATTTAGGTGA 100 

TGATACAGTT CGTACAGTTG CGATGTCTTC CACAGATGGA CTTGTTCGTG 150 

GCACAGAAGT AGAAGATACT GGTAAAGCAA TCTCTGTACC AGTTGGTGAT 200 

GCAACACTTG GACGTGTATT CAACGTATTA GGTGATGCAA TTGACTTAGA 250 

15 TGGTGAACTT CCTGCGGATG TACACCGTGA TCCAATTCAC CGTCAAGCAC 300 

CTGCATTCGA AGAATTATCT ACTAAAGTAG AAATTCTTGA AACTGGTATT 350 

AAAGTAGTAG ACTTACTTGC TCCTTACATT AAGGGTGGTA AGATCGGCCT 4 00 

ATTCGGTGGT GCCGGCGTAG GTAAAACAGT ATTAATTCAG GAGTTAATCA 450 

ATAACATCGC ACAAGAGCAC GGTGGTATCT CTGTATTCGC TGGTGTAGGT 500 

20 GAGCGTACTC GTGAGGGTAA TGACTTATAC CACGAAATGA GCGATTCTGG 550 

CGTAATCAAG AAAACTGCGA TGGTATTCGG ACAAATGAAC GAGCCACCTG 600 

GAGCACGTCA ACGTGTTGCA TTAACAGGTT TAACAATGGC TGAGCATTTC 650 

CGTGATGAGC AAGGACAAGA CGTACTTCTG TTCATCGATA ACATCTTCCG 700 

TTTCACGCAA GCGGGTTCTG AAGTATCTGC CCTTCTTGGT CGTATGCCAT 750 

25 CTGCGGTAGG TTACCAACCA ACACTTGCAA CAGAAATGGG TCAATTACAA 800 

GAGCGTATTA CATCTACAAA TAAAGGGTCT ATCACGTCT 839 



30 2) INFORMATION -FOR SEQ ID NO: 1262 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: BGSC 4H2 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1262 



AAGCTACCAG AAATCTACAA TGCCCTTACG GTAAAACAAA GCAACGAAAA 50 

CGGAAGCATG AACTTAACAT TTGAAGTTGC ACTTCATTTA GGTGATGATA 100 

CAGTTCGTAC AGTTGCGATG TCTTCCACAG ATGGACTTGT TCGTGGCACA 150 

GAAGTAGAAG ATACTGGTAA AGCAATCTCT GTACCAGTTG GTGATGCAAC 200 

50 ACTTGGACGT GTATTCAACG TATTAGGTGA TGCAATTGAC TTAGATGGTG 250 

AACTTCCTGC GGATGTACAC CGTGATCCAA TTCACCGTCA AGCACCTGCA 300 

TTCGAAGAAT TATCTACTAA AGTAGAAATT CTTGAAACTG GTATTAAAGT 350 

AGTAGACTTA CTTGCTCCTT ACATTAAGGG TGGTAAGATC GGCCTATTCG 400 
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GTGGTGCCGG 
ATCGCACAAG 
TACTCGTGAG 
TCAAGAAAAC 
CGTCAACGTG 
TGAGCAAGGA 
CGCAAGCGGG 
GTAGGTTACC 
TATTACATCT 



CGTAGGTAAA 
AGCACGGTGG 
GGTAATGACT 
TGCGATGGTA 
TTGCATTAAC 
CAAGACGTAC 
TTCTGAAGTA 
AACCAACACT 
ACAAATAAAG 



ACAGTATTAA TTCAGGAATT AATCAATAAC 
TATCTCTGTA TTCGCTGGTG TAGGTGAGCG 
TATACCACGA AATGAGCGAT TCTGGCGTAA 
TTCGGACAAA TGAACGAGCC ACCTGGAGCA 
AGGTTTAACA ATGGCTGAGC ATTTCCGTGA 
TTCTGTTCAT CGATAACATC TTCCGTTTCA 
TCTGCCCTTC TTGGTCGTAT GCCATCTGCG 
TGCAACAGAA ATGGGTCAAT TACAAGAGCG 
GGTCTATCAC GTC 



450 
500 
550 
600 
650 
700 
750 
800 
833 
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2) INFORMATION FOR SEQ ID NO: 1263 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 790 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: BGSC 4Q1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1263 

CGAAAACGGA AGCATGAACT TAACATTTGA AGTTGCACTT CATTTAGGTG 50 

ATGATACAGT TCGTACAGTT GCGATGTCTT CCACAGATGG ACTTGTTCGT 100 

GGCACAGAAG TAGAAGATAC TGGTAAAGCA ATTTCTGTAC CAGTTGGTGA 150 

TGTAACACTT GGACGTGTAT TCAACGTATT AGGTGATGCA ATTGACTTAG 200 

ATGGTGAACT TCCTGCGGAT GTACACCGTG ATCCAATTCA CCGTCAAGCA 250 

CCTGCATTCG AAGAATTATC TACTAAAGTA GAAATTCTTG AAACTGGTAT 300 

TAAAGTAGTA GACTTACTTG CTCCTTACAT TAAGGGTGGT AAGATCGGCC 350 

TATTCGGTGG TGCCGGTGTA GGTAAAACAG TATTAATTCA GGAATTAATT 400 

AACAACATCG CACAAGAGCA CGGTGGTATC TCTGTATTCG CTGGTGTAGG 450 

TGAGCGTACT CGTGAGGGTA ATGACTTATA CCACGAAATG AGCGATTCTG 500 

GCGTAATCAA GAAAACTGCG ATGGTATTCG GACAAATGAA CGAGCCACCT 550 

GGAGCACGTC AACGTGTTGC ATTAACAGGT TTAACAATGG CTGAGCATTT 600 

CCGTGATGAG CAAGGACAAG ACGTACTTCT GTTCATCGAT AACATCTTCC 650 

GTTTCACGCA AGCGGGTTCT GAAGTATCTG CCCTTCTTGG TCGTATGCCA 700 

TCTGCGGTAG GTTACCAACC AACACTTGCA ACAGAAATGG GTCAATTACA 750 

AGAGCGTATT ACATCTACAA ATAAAGGGTC TATCACGTCT 7 90 



2) INFORMATION FOR SEQ ID NO: 1264 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 4 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1232 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1264 

AGTTGCACTT CATTTAGGTG ATGATACAGT TCGTACAGTT GCGATGTCTT . 50 

CCACAGATGG ACTTGTTCGT GGCACAGAAG TAGAAGATAC TGGTAAACCA 100 

ATCTCTGTAC CAGTTGGTGA TGTAACACTT GGTCGCGTAT TTAACGTATT 150 

AGGTGATGCA ATTGACTTAG ATGGTGAGGT TCCTGCAGAT GTACATCGTG 200 

ATCCAATTCA CCGTCAAGCA CCTGCATTCG AAGAATTATC TACTAAAGTA 250 

GAAATTCTTG AAACTGGTAT TAAAGTAGTA GACTTACTTG CTCCTTACAT 300 

TAAGGGTGGT AAGATCGGCC TATTCGGTGG TGCCGGCGTA GGTAAAACAG 350 

TATTAATTCA GGAATTAATT AACAACATCG CACAAGAGCA CGGTGGTATC 400 

TCTGTATTCG CTGGTGTAGG TGAGCGTACT CGTGAGGGTA ATGACTTATA 450 

CCACGAAATG AGCGATTCTG GCGTAATCAA GAAAACTGCG ATGGTATTCG 500 

GACAAATGAA CGAGCCACCT GGAGCACGTC AACGTGTTGC ATTAACAGGT 550 

TTAACAATGG CTGAGCATTT CCGTGATGAG CAAGGACAAG ACGTACTTCT 600 

GTTCATCGAT AACATCTTCC GTTTCACGCA AGCGGGTTCT GAAG 64 4 



2) INFORMATION FOR SEQ ID NO: 1265 

(i) SEQUENCE CHARACTERISTICS: i 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: ATCC 4229 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1265 

GGCGGAAAGC TACCAGAAAT CTACAACGCC CTTACGGTAA AACAGAGCAA 50 

CGAAAACGGA ACAAGCATTA ACTTAACATT TGAAGTTGCA CTTCATTTAG 100 

GTGATGACAC AGTTCGTACA GTTGCAATGT CTTCCACAGA TGGACTTGTT 150 

CGTGGCACAG AAGTAGAAGA TACTGGTAAA GCAATCTCTG TACCAGTTGG 200 

TGATGCAACA CTTGGTCGTG TATTTAACGT ATTAGGTGAT GCAATTGACT 250 

TAGATGGTGA GGTTCCTGCG GATGTACGTC GTGATCCAAT TCACCGTCAA 300 

GCACCTGCAT TCGAAGAATT ATCTACTAAA GTAGAAATTC TTGAAACTGG 350 

TATTAAAGTA GTAGACTTAC TTGCTCCTTA CATTAAGGGT GGTAAGATCG 4 00 

GTCTATTCGG TGGTGCCGGT GTAGGTAAAA CGGTATTAAT TCAGGAATTA 4 50 

ATCAATAACA TCGCACAAGA ACACGGTGGT ATCTCTGTAT TCGCTGGTGT 500 

AGGTGAGCGT ACTCGTGAGG GTAATGACTT ATACCACGAA ATGAGCGATT 550 
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CTGGCGTAAT TAAGAAAACT GCGATGGTAT TCGGACAAAT GAACGAGCCA 
CCTGGAGCAC GTCAACGTGT TGCGTTAACA GGTTTAACAA TGGCTGAGCA 
TTTCCGTGAT GAGCAAGGAC AAGATGTACT TCTGTTCATC GATAATATCT 
TCCGTTTCAC GCAAGCAGGT TCTGAAGTAT CTGCCCTTCT TGGCCGTATG 
CCATCTGCGG TAGGTTACCA ACCAACACTT GCAACAGAAA TGGGTCAATT 
ACAAGAGCGT ATTACATCTA CAA 



600 
650 
700 
750 
800 
823 



2) INFORMATION FOR SEQ ID NO: 1266 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 715 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 

(B) STRAIN: ATCC 200443 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1266 

TGGTCCGAGR CCCGATTCMA TGAAATTATC AAGGAAACCT CCAAYTTCAT 50 

TAAGAAGGTC GGATATAACC CCAAGACTGT TCCTTTCGTT CCCATTTCTG 100 

GTTTCCAGGG TGACAACATG ATCGATSCCT CTGCCAACTG CCCATGGTAC 150 

AAGGGCTGGT ACMAKGAGAC TGCCGACAGG GAAGYACTCT GGCAAGACCC 200 . 

TTCTTGAGGC CATTGACGSC ATTGAGCCCC CCAMSCGTCC TWCCGATAAA 250 

CCTCTCCGTC TTCCTCTCCA GGATGTCTAC AAGATCTCCG GTATTGGMAC 300 

TGTTCCTGTC GGACGTRTTG AGACTGGAGT CATCAAGCCC GGTATGGTCG 350 

TGACCTTCGC TCCCGCCAAC GTCACCACTG AAGTCAAGTC CGTTGAAATG 400 

CACCACCAGC AGCTTTCCGA CGGTAWCCCC GGTGACAACG TCGGCTTCAA 450 

CGTCAAGAAT GTTTCCGTCA AAGAAGTCCG CCGTGGTAAC GTTGCCTGGT 500 

GACTCTAAGA ATGATCCCGC MAWGGGCTGC GATTCCTTCA ATGCYCAGGT 550 

CATCGTCCTC AACCACCCTG GTCAGGTTGG CGCTGGTTAT GCCCCAGTCC 600 

TCGAYTGCCA TACTGCCCAC ATTGCYTGCA ARTTCGCTGA GMTCMAKGAG 650 

AAGATTGAYC GCCGAACCGG MAAGTCTGTT GAGAACGCCC CCAAGTTCAT 700 

CAAGTCCGGT GATGC 715 



2) INFORMATION FOR SEQ ID NO: 1267 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 875 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastomyces dermatitidis 

(B) STRAIN: ATCC 56220 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1267 

GAGTCCTCTT ATTTACTTTT GTCATGACTA CCTTACTAAT CTGTCATAGA 50 

TCGTTACAAC GAAATCGTCA AGGAGACTTC CAACTTCATC AAGAAGGTCG 100 

GATACAACCC CAAGAACGTT CCTTTCGTTC CTATCTCCGG TTTCAACGGC 150 

GACAACATGC TTGAGCCCTC CCCCAACTGC CCCTGGTACA AGGGTTGGGA 200 

GAAGGAGACC AAGGCCGGTA AGGTCACTGG TAAGACCCTC CTCGAGGCCA 250 

TCGACGCCAT TGAGCCCCCT ACCCGTCCCG CCAACAAGGT CAGTACTACC 300 

TCAATTACTT GAACTCTCTT CATACGTTCC GATTACTGAC TGCTTCACAG 350 

CCCCTCCGTC TTCCCCTCCA GGACGTTTAC AAGATCGGTG GTATTGGAAC 400 

GGTGCCCGTC GGTCGTGTTG AGACCGGTAC CATCTCCCCT GGTATGGTCG 450 

TTACCTTGTA TGTATCCTGA CCATCCCCCT TGGCAATCAT TACGTACTAA 500 

CTCACTCTTC AGCGCTCCCG CCAACGTCAC CACTGAAGTC AAGAGTGTTG 550 

AAATGCACCA CCAGCAGCTC GCTGCCGGTC AGCCCGGTGA CAACGTTGGT 600 

TTCAACGTGA AGAACGTCTC CGTCAAGGAA ATCCGTCGTG GTAACGTTGC 650 

TGGTGATAGC AAGAACGACC CCCCTGCCGG TGCTGCTTCC TTCAACGCCC 700 

AGGTCATCGT CCTCAACCAC CCCGGTCAGG TCGGTGCTGG TTACGCCCCA 750 

GTCCTTGACT GCCACACTGC CCACATTGCT TGCAAGTTCT CTGAACTCCT 800 

TGAGAAGATT GACCGTCGTA CCGGAAAGTC TGTTGAGGAC CACCCCAAGT 850 

TCATCAAGTC CGGTGACGCT GCCAT 875 



2) INFORMATION FOR SEQ ID NO: 1268 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1124 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Histoplasma capsulatum 

(B) STRAIN: WSA-377 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1268 

GTGAGCGTGG TATCACCATC GATATTGCCC TCTGGAAATT CGAGACCCCG 50 

AAGTACAGTG TCACTGTCAT TGGTGAGTGC TTTTTACCCC TCTTAAGCAG 100 

ATTTCAACTT CCAGAGTATC TACTCTAACA TATCCGCTTA GATGCTCCCG 150 

GCCATCGTGA CTTCATCAAG AACATGATCA CTGGTACCTC CCAGGCTGAC 200 

TGCGCTATCC TCATCATTGC TGCCGGTACT GGTGAGTTCG AGGCTGGTAT 250 

CTCCAAGGAT GGCCAGACTC GTGAGCACGC TCTGCTTGCT TTCACCCTTG 300 

GTGTGAGGCA ACTCATCGTT GCCATCAACA AGATGGACAC CACCAAGTGG 350 

TCCGAGTCCC GTTTCAACGA AATCATCAAG GAGGTTTCCA ACTTCATCAA 400 

GAAGGTCGGA TATAACCCCA AGGCTGTTCC CTTCGTGCCA ATCTCTGGTT 450 

TCGAGGGTGA CAACATGATT GAACCCTCCC CCAACTGCAC CTGGTACAAG 500 
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GGCTGGAACA AGGAGACTGC CTCTGGCAAG TCTTCTGGTA AAACCCTTCT 550 

CGATGCCATT GACGCCATTG AACCCCCAAC CCGTCCTACC GATAAGCCTC 600" 

TCCGTCTTCC CCTCCAGGAT GTTTACAAAA TCTCTGGTAT TGGCACTGTT 650 

CCCGTCGGAC GTGTTGAGAC TGGTGTCATC AAGCCCGGTA TGGTCGTGAC 700 

5 TTTCGCTCCC TCCAACGTCA CCACTGAAGT CAAGTCCGTC GAGATGCACC 750 

ACCAACAACT CCAGGCTGGT TACCCTGGTG ACAACGTCGG CTTCAACGTC 800 

AAGAACGTTT CAGTCAAGGA AGTCCGCCGT GGCAACGTTG CTGGCGACTC 850 

CAAAAATGAT CCCCCCAAGG GCTGCGAATC CTTCAATGCC CAGGTCATCG 900 

TCCTTAACCA CCCCGGCCAG GTTGGCGCTG GTTATGCCCC AGTCCTCGAC 950 

10 TGCCACACTG CCCACATTGC TTGCAAGTTC TCTGAACTCA TTGAGAAGAT 1000 

CGACCGCCGT ACTGGAAAGT CTGTTGAGAA CAACCCCAAG TTCATCAAGT 1050 

CTGGTGATGC TGCTATCGTC AAGATGGTTC CCTCCAAGCC CATGTGCGTG 1100 

GAGCCCTTCA CTGACTATCC CCCT 1124 

15 

2) INFORMATION FOR SEQ ID NO: 1269 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1043 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(.vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton rubrum 

(B) STRAIN : t WSA-224 



30 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1269 



GTGAGCGTGG TATCACCATC GATATCGCCC TCTGGAAGTT CGAGACCCCC 50 

AAGTACAATG TCACCGTCAT TGGTATGTTT CTTTGCCTTG TTCCCTCATG 100 

35 TGGTTGTACC ATATCTAACG AGAGTAGACG CCCCCGGTCA CCGTGACTTC 150 

ATCAAGAACA TGATCACTGG TACCTCCCAG GCTGACTGCG CTATTCTCAT 200 

CATTGCTGCC GGTACTGGTG AGTTCGAGGC TGGTATCTCC AAGGATGGCC 250 

AGACCCGTGA GCACGCTCTG CTCGCCTTCA CCCTCGGTGT CAAGCAGCTC 300 

ATCGTTGCCA TCAACAAGAT GGACACCACC GGCTGGTCCG AGGATCGTTT 350 

40 CAAGGAAATT ATCAAGGAAG TCACCAACTT CATCAAGAAG GTTGGCTACG 400 

ACCCCAAGGG TGTTCCATTC GTTCCAATCT CTGGTTTCAA CGGTGACAAC 450 

ATGATTGAGG CCTCCACCAA CTGCCCATGG TACAAGGGAT GGAACAAGGA 500 

GACCAAGGCC GGTGGTGCCA AGTCCGGCAA GACCCTCCTC GAGGCCATCG 550 

ATGCCATCGA CATGCCAACC CGTCCTACCG ACAAGCCCCT CCGTCTCCCA 600 

45 CTCCAGGATG TCTACAAGAT CTCTGGTATC GGAACTGTGC CAGTCGGTCG 650 

TGTTGAGACC GGTATCATCA AGCCCGGTAT GGTCGTCACC TTNGCCCCCG 700 

CCAACGTCAC CACTGAAGTC AAGTCCGTYK AAATGCACCA CCAGCAGCTT 750 

CAGCAGGGTG TCCCCGGTGA CAACGTCGGC TTCAATGTCA AGAACGTTTC 800 

CGTCAAGGAA GTCCGCCGTG GTAACGTTGC CGGTGACTCC AAGAACGACC 850 

50 CACCATCCGG CTGTGCCTCC TTCAACGCCC AGGTCATYGT CCTCAACCAC 900 

CCCGGCCAGA TCGGTGCTGG TTACGSTCCA GTCCTCGACT GCCACACTGS 950 

TCACATTGCT TGCAAGTTCG CTGAGCTCCT CGAGAAGATT GACCGCCGTA 1000 

CCGGTAAATC CGTCGAAGCC AACCCCAAGT TCGTCAAGTC TGG 104 3 
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2) INFORMATION FOR SEQ ID NO: 1270 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1105 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microsporum canis 

(B) STRAIN: WSA-217 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1270 

GCTGAGCGTG AGCGTGGTAT CACCATTGAT ATCGCCCTCT GGAAGTTCGA 50 

GACCCCCAAG TACATGGTCA CCGTCATCGG TATGCTTTAT CTGTTTCCCA 100 

TTTATAGTTG CGACCAGTAA CTAACAAAAA GTAGATGCCC CCGGGCACCG 150 

TGACTTCATC AAGAACATGA TTACTGGTAC CTCCCAGGCC GACTGCGCTA 200 

TTCTCATCAT TGCTGCCGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG 250 

GATGGCCAGA CTCGTGAGCA CGCCCTGCTC GCTTTCACCC TCGGTGTCAA 300 

GCAGCTCATC GTTGCCATCA ACAAGATGGA CACCACCAAC TGGTCTGAGT 350 

CCCGTTTCGG TGAAATCATC AAGGAAGTCA CCAACTTCAT CAAGAAGGTC 400 

GGCTACGACC CCAAGGGTGT CCCATTCGTC CCAATCTCTG GCTTCAACGG 450 

TGACAACATG ATTGAGCCCT CCACCAACTG CCCATGGTAC AAGGGATGGA 500 

ACAAGGAGAC CAAGGCCGGT GGCAAATCCT CTGGTAAGAC CCTCCTTGAG 550 

GCCATCGATG CCATTGACAT GCCCACTCGT CCCACCGACA AGCCTCTCCG 600 

TCTCCCACTC CAGGATGTCT ACAAGATCTC TGGTATCGGA ACAGTACCAG 650 

TCGGTCGTGT TGAGACTGGT ATCATCAAGC CTGGTATGGT TGTCACTTTY 700 

GCCCCCGCCA ACGTCACCAC TGAAGTCAAG TCCGTCGAAA TGCACCACCA 750 

GCAGCTYGTC CAGGGTGTTC CCGGTGACAA CGTTGGCTTC AACGTCAAGA 800 

ACGTYTCTGT CAAGGAAGTC CGCCGTGGTA ACGTTGCCGG TGATTCCAAG 850 

AACGACCCAC CAGCTGGCTG CGCCTCTTTC AAGGCCCAGG TCATCGTCCT 900 

CAACCACCCC GGCCAGATCG GTGCTGGTTA CGCCCCAGTC CTTGACTGCC 950 

ACACTGCCCA CATTGCTTGC AAGTTCTCTG AGCTTCTTGA GAAGATTGAC 1000 

CGCCGTACTG GTAAATCCGT CGAAACCAGC CCTAAGTTCG TCAAGTCTGG 1050 

TGATGCCGCT ATTGCCACCA TGGTTCCATC CAAGCCCATG TGCGTTGAGG 1100 

CTTTC 1105 



2) INFORMATION FOR SEQ ID NO: 1271 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1244 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Aspergillus versicolor 

(B) STRAIN: WSA-175 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1271 

5 

GAGCGTGAGC GTGGTATCAC CATCGATATC GCTCTCTGGA AGTTCCAGAC , 50 

CCCTAAGTAT GAGGTCACCG TCATTGGTAT GTTGTCCTTC TTGTGTTACC 100 

ATCGAAACAT ATCTAACCTA CAACTGCAGA CGCCCCCGGT CACCGTGACT 150 

TCATCAAGAA CATGATCACT GGTACCTCCC AGGCCGACTG CGCTATTCTC 200 

10 ATCATTGCTT CCGGTACTGG TGAATTCGAG GCTGGTATCT CCAAGGATGG 250 

CCAGACCCGT GAGCACGCTC TGCTCGCTTT CACCCTCGGT GTCCGTCAGC 300 

TCATCGTTGC CCTCAACAAG ATGGACACTG CTGGCTGGGC TGAGGCTCGT 350 

TACAACGAAA TCGTCAAGGA AACTTCCGGT TTCATCAAGA AGGTCGGCTA 400 

CAACCCCAAG TCGGTTCCCT TCGTCCCCAT CTCCGGTTTC AACGGTGACA 450 

15 ACATGCTTGA GCCCTCCTCC AACTGCCCCT GGTACAAGGG TTGGGAGAAG 500 

GAGACCAAGG CTGGTAAGGC CACTGGTAAG ACCCTCCTCG AGGCCATCGA 550 

CGCCATTGAG CCTCCCGTCC GTCCCTCCAA CAAGCCTCTC CGTCTTCCCC 600 

TCCAGGATGT CTACAAGATC TCTGGTATTG GAACTGTCCC CGTCGGCCGT 650 

GTCGAGACCG GTACCATCGT CCCCGGTATG GTCGTCACCT TCGCTCCCGC 700 

20 CAACGTCACC ACTGAAGTCA AGTCCGTTGA GATGCACCAC CAGCAGCTCA 750 

AGGAGGGTGT TCCCGGTKAC AACGTTGGTT TCAACGTGAA GAACGTTTCC 800 

GTCAAGGAAG TCCGCCGTGG TAACGTCGCT GGTGACTCCA AGAACGACCC 850 

CCCTGCCGGT GCTGCCTCTT TCACCGCCCA GGTCATCGTC CTCAACCACC 900 

CCGGTCAGGT CGGCGCTGGT TACGCTCCCG TCCTCGACTG CCACACCGCT 950 

25 CACATTGCCT GCAAGTTCGC TGAGCTCCAG GAGAAGATCG ACCGCCGTAC 1000 

CGGAAAGTCT GTCGAATYTG CCCCCAAGTT CATCAAGTCT GGTGACGCCG 1050 

CTATCGTCAA GATGATTCCC TCCAAGCCCA TGTGTGTCGA GTCTTTCACT 1100 

GACTACCCTC CTYTCGGCCG TTTCGCCGTC CGTGACGTAA GTTCTTTCCC 1150 

CAGCTTTTCG ATGCTACCCT TCTMTGAATC ACGTGTCATG TCTTGGCACC - 1200 

30 CGCCCATCAC ATGACCACGC AACCCTATAC CCCGCCACAC CCTT 1244 



2) INFORMATION FOR SEQ ID. NO: 1272 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Exophiala moniliae 

(B) STRAIN: WSA-219 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1272 

50 GCTGAGCGTG AGCGTGGTAT CACCATCGAT ATCGCTCTCT GGAAGTTCGA 50 

GACCCCCAAG TACTATGTCA CCGTCATCGA CGCCCCCGGT CATCGTGACT 100 

TCATCAAGAA CATGATCACT GGTACTTCCC AAGCTGACTG CGCCATTCTC 150 

ATCATTGCTG CCGGTACTGG TGAATTCGAA GCCGGTATCT CCAAGGATGG 200 
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TCAGACCCGT GAGCACGCTC TGCTTGCCTA CACCCTGGGT GTCAAGCAGC 250 

TCATTGTCGC CATCAACAAG ATGGACACTA CCAAGTGGTC TGAGGACCGT 300 

TTCAACGAAA TCATCAAGGA GACTTCCAGC TTCATCAAGA AGGTCGGCTA 350 

CAACCCCAAG TCCGTTCCTT TCGTCCCCAT CTCCGGCTTC AACGGTGACA 400 

5 ACATGATCGA CGTCTCCACC AACTGCCCCT GGTACAAGGG CTGGGAGAAG 4 50 

GAGACCAAGG CTGGCAAGGC CTCTGGCAAG ACTCTCCTTG AGGCCATCGA 500 

CGCCATTGAC CCCCCCTCTC GTCCCACCGA CAAGCCTYTC CGTCTCCCTC 550 

TCCAGGATGT GTACAAGATC TCTGGTATCG GAACGGTGCC CGTCGGTCGT 600 

GTCGAGACTG GTATCATCAA GGCCGGTATG GTCGTTACCT TCGCTCCTGC 650 

10 CAACGTCACC ACTGAAGTCA AGTCCGTCGA AATGCACCAC GAACAGCTCG 700 

CCGAGGGTGT TCCAGGTGAC AACGTCGGTT TCAACGTCAA GAACGTYTCC 7 50 

GTCAAGGAGG TTCGTCGTGG AAACGTTTGC GGTGACTCCA AGAACGACCC 800 

ACCCAAGGGC GCTGATTCCT TCAACGCCCA GGTCATCGTC TTGAACCACC 850 

CTGGTCAAGT TGGTGCTGGC TACGCCCCAG TGTTGGATTG CCACACTGCC 900 

15 CACATTGCTT GCAAGTTCTC TGAGCTTCTC GAGAAGATCG ATCGTCGKAC 950 

CGGAAAGTCG ATCGAAAACA ACCCCAAGTT CATCAAGTCT GGTGACGCTG 1000 

CCATCGTCAA GATGGTTCCC AGCAAGCCCA TG 1032 



20 



2) INFORMATION FOR SEQ ID NO: 1273 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1106 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Hortaea werneckii 

(B) STRAIN: ATCC 34944 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1273 

TAACAGCCCG TGCGACCGTC TCGTCCAATA ACTAATGATT CGCAGGTACT 50 

ATGTCGTGAG TATCCGGTCC TTTTTTGTTA ATTTACCAGA AATGACGAAA 100 

TTTTTGACTA ATTAACACAC TCAGACCGTC ATTGACGCCC CGGGTCACCG 150 

4 0 TGACTTCATC AAGAACATGA TCACTGGTAC CTCCCAGGCC GACTGCGCTG 200 

TCCTCATCAT TGCTGCCGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG 250 

GATGGCCAGA CCCGTGAGCA CGCCCTGCTC GCCTACACCC TCGGTGTCAA 300 

GCAGCTCATC GTCGCCATCA ATAAGATGGA CACCACCAAG TGGTCCGAGG 350 

AGCGTTACGG CGAGATCATC AAGGAGACCT CTGCCTTCAT CAAGAAGGTC 400 

4 5 GGTTTCAACC CGAAGCACGT CCCGTTCGTC CCGATCTCCG GTTTCAACGG 4 50 

TGACAACATG ATCGAGGCCT YCACCAACTG CCCGTGGTAC AAGGGCTGGG 500 

AGAAGGAGAC CAAGGCCAAG GTCACCGGCA AGACCCTYCT TGAGGCCATT 550 

GACAACATCG ACCCGCCGAG CCGTCCTTCC GACAAGCCGC TCCGTCTTCC 600 

CCTCCAGGAT GTCTACAAGA TCGGTGGTAT TGGGACAGTC CCAGTCGGCC 650 

50 GTGTCGAGAC CGGTACCATC AAGGCCGGCA TGGTCGTTAC CTTCGCTCCG 700 

GCTGGTGTCA CCACTGAAGT GAAGTCCGTT GARATGCACC ACGAGCAGCT 750 

CGCTGAGGGT YTGCCGGGTG ACAACGTCGG CTTCAACGTC AAGAACGTTT 800 

CCGTCAAGGA GATCCGTCGT GGCAACGTTG CTGGTGACAG CAAGGCTGAC 850 
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CCGCCGAAGG GCTGCGACAG CTTCAACGCC 

CCCTGGCCAG GTCGGTGCTG GTTACGCTCC 

CCCACATTGC CTGCAAGTTC GGCGAGCTCC 

TYTGGCAAGT CCATTGAAGC CTYGCCTAAG 

5 TGCCATYGTC AAGATGATTC CGTCCAAGCC 
CTGAGT 
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CAGGTCATCG TCCTGAACCA 900 
AGTCCTGGAC TGCCACACTG 950 
TCGAGAAGAT CGACCGTCGC 1000 
TACATCAAGT CTGGTGACGC 1050 
GATGTGCGTT GAGCCATTCA 1100 

1106 



10 2) INFORMATION FOR SEQ ID NO: 1274 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1119 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium solani 

(B) STRAIN: ATCC 32793 



25 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1274 



CTCAAGGCCG AGCGTGAGCG TGGTATCACC ATCGACATTG CCCTCTGGAA 50 

GTTCGAGACT CCCCGCTACT ATGTCACCGT CATTGGTATG TTGCTGTCAC 100 

CTCTCTCACA CATGTCTCAC CACTAACAAT CAACAGACGC CCCCGGCCAC 150 

CGTGATTTCA TCAAGAACAT GATCACTGGT ACTTCCCAGG XCGACTGCGC 200 

30 CATTCTCATC ATTGCCGCCG GTACTGGTGA GTTCGAGGCT GGTATCTCCA 250 

AGGATGGCCA GACCCGTGAG CACGCCCTGC TCGCCTACAC CCTCGGTGTC 300 

AAGAACCTCA TTGTCGCCAT CAACAAGATG GACACCACCA AGTGGTCCGA 350 

GTCCCGTTAC CAGGAGATCA TCAAGGAGAC CTCCAACTTC ATCAAGAAGG 4 00 

TCGGCTACAA CCCCAAGGCT GTCGCTTTCG TCCCCATCTC CGGTTTCAAC 450 

35 GGCGACAACA TGCTTACTCC CTCCACCAAC TGCCCCTGGT ACAAGGGCTG 500 

GGAGCGTGAG ATCAAGTCCG GCAAGCTCAC TGGCAAGACC CTCCTCGAGG 550 

CCATTGACTC CATCGAGCCC CCCAAGCGTC CCGTCGACAA GCCCCTYCGA 600 

CTTCCCCTCC AGGATGTCTA CAAGATCGGT GGTATTGGCA CGGTTCCCGT 650 

CGGCCGTATC GAGACTGGTG TCATCAAGCC CGGTATGGTC GTTACCTTMG 700 

40 CCCCCTCCAA CGTCACCACT GAAGTCAAGT CCGTCGARAT GCACCACGAG 750 

CAGCTCTYTG AGGGTCTTCC CGGTGACAAC GTCGGCTTCA ACGTGAAGAA 800 

CGTYTCCGTC AAGGAGATCC GACGTGGCAA CGTCGCTGGT GACTCCAAGA 850 

ACGACCCCCC TYTGGGTGCC GCCTCTTTCA CCGCCCAGGT CATTGTCCTC 900 

AACCACCCTG GCCAGGTCGG TGCCGGTTAC GCCCCCGTTY TGGACTGCCA 950 

45 CACTGCCCAC ATTGCCTGCA AGTTCGCCGA GATCCAGGAG AAGATCGACC 1000 

GCCGAACTGG TAAGGCTGTT GAGTCCGCCC CCAAGTTCAT CAAGTCTGGT 1050 

GACTCCGCCA TCGTCAAGAT GGTTCCCTCC AAGCCCATGT GCGTTGAGGC 1100 

TTTCACTGAC TACCCCCCT 1119 



2) INFORMATION FOR SEQ ID NO: 1275 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1113 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Aureobasidium pullulans 

(B) STRAIN: WSA-234 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1275 

15 CTGAAGTCTG AGCGTGAGCG TGGTATCACT ATCGATATCG CCCTCTGGAA 50 

GTTCGAGACC CCCAAGTACA TGGTCACCGT CATCGATGCC CCCGGTCACC 100 

GTGATTTCAT CAAGAACATG ATCACTGGTA CCTCCCAGGC TGACTGCGCC 150 

ATTCTCATCA TTGCTGCCGG TACTGGTGAG TTCGAGGCTG GTATCTCCAA 200 

GGATGGCCAG ACTCGTGAGC ACGCTCTCCT CGCCTACACC CTTGGTGTCA 250 

20 AGCAGCTCAT CGTTGCCATC AACAAGATGG ACACCACCAA GTGGTCTGAG 300 

GCCCGTTACC AGGAGATCAT CAAGGAGACC TCCGGTTTCA TCAAGAAGGT 350 

CGGCTACAAC CCCAAGCACG TTCCCTTCGT CCCCATCTCC GGCTTCAACG 400 

GAGACAACAT GATCGAGGTC AGCTCCAACT GCCCTTGGTA CAAGGGTTGG 450 

GAGAAGGAGA CCAAGGCCAA GGCCACTGGC AAGACCCTCC TCGAGGCCAT 500 

25 TGACGCCATC GACCCTCCTT CCCGTCCTAC CGACAAGCCC CTCCGTCTTC 550 

CCCTCCAGGA TGTCTACAAG ATCGGTGGTA TTGGCACGGT GCCCGTCGGT 600 

CGTGTTGAGA CCGGTAAGAT CATGGGTGGT ATGGTTGTCA CCTTCGCCCC 650 

CGCTGGTGTC ACCACCGAGG TCAAGTCCGT CGAGATGCAC CACGAGCAGC 700 

TCACCGAGGG TCTTCCCGGT GACAACGTCG GCTTCAACGT CAAGAACGTY 750 

30 TCCGTCAAGG AGATCCGTCG TGGCAACGTT GCCGGTGACT CCAAGAACGA 800 

CCCCCCCAAG GGTTGTGACT CCTTCAACGC CCAGGTCATC GTCCTGAACC 850 

ACCCTGGTCA GGTCGGTGCT GGTTACGCAC CCGTCCTCGA CTGCCACACT 900 

GCCCACATTG CTTGCAAGTT YTCCGAGCTT GTTGAGAAGA TTGACCGCAG 950 

AACCGGCAAG TCCGTTGAGG CTGCCCCCAA GTTCATCAAG TCTGGTGACG 1000 

35 CCGCCATCGT CAAGATGGTT CCCTCCAAGC CCATGTGTGT TGAGGCTTTC 1050 

ACCGACTACC CTCCTYTCGG TCGTTTCGCC GTCCGGTATG TTTTTCTTCT 1100 

TCAATCATTC TTC 1113 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1276 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 731 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastomyces derma titldis 

(B) STRAIN: ATCC 14112 

691 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(xih SEQUENCE DESCRIPTION: SEQ ID NO: 1276 

GGTCTGAGAC CCGTTTCAAC GAAATTATCA AGGAAGTCAG CAACTTCATC 50 

5 AAGAAGGTCG GATACAACCC CAAGTCTGTT CCCTTCGTGC CAATCTCCGG 100 

TTTCGAGGGT GACAACATGA TTGAGCCTTC CCCCAACTGC CCTTGGTACA 150 

AGGGCTGGAA CAAGGAGACT GCTGCTGGCA AGGCCGCCGG TAAGACTCTT 200 

CTCGATGCCA TTGACGCCAT CGACCCCCCC GTCCGTCCTA CCGAGAAGCC 250 

TCTCCGTCTT CCCCTCCAGG ATGTGTACAA GATCTCCGGT ATTGGCACTG 300 

10 TTCCCGTTGG ACGTGTCGAG ACTGGTGTCA TCAAGCCTGG TATGGTCGTG 350 

ACCTTCGCCC CTGCCAACGT CACCACTGAA GTCAAGTCCG TTGAAATGCA 400 

CCACCAGCAG CTCCAGGCTG GTTACCCTGG TGACAACGTC GGCTTCAACG 450 

TCAAGAACGT TTCCGTCAAG GAAGTCCGCC GTGGCAACGT TGCCGGTGAC 500 

TCCAAGAACG ACCCCCCCAA GGGCTGCGAG TCCTTCAACG CCCAGGTCAT 550 

15 CGTCCTCAAC CACCCCGGCC AGGTTGGCGC TGGTTATGCC CCAGTCCTCG 600 

ACTGCCACAC TGCCCACATT GCTTGCAAGT TCTCTGAGCT CATCGAGAAG 650 

ATTGACCGCC GTACCGGAAA GTCTGTTGAG GACAACCCCA AGTTCATCAA 700 

GTCCGGTGAT GCTGCTATCG TCAAGATGAT T 731 



20 



35 



2) INFORMATION FOR SEQ ID NO: 1277 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 1046 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double,. 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Exophiala dermatitidis 

(B) STRAIN: ATCC 76088 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1277 



CCGAGCGTGA GCGTGGTATC ACCATCGATA TCGCCCTCTG GAAGTTCGAG 50 

ACCCCCAAGT ACTATGTCAC CGTCATCGAC GCCCCGGGTC ATCGTGACTT 100 

40 TATCAAGAAC ATGATCACTG GTACCTCGCA GGCCGACTGC GCCATCTTGA 150 

TCATTGCCGC CGGTACCGGT GAATTCGAAG CCGGTATCTC CAAGGATGGT 200 

CAGACCCGTG AGCACGCTCT GCTCGCCTAC ACCTTGGGTG TCAAGCAGCT 250 

CATCGTCGCC ATCAACAAGA TGGACACCAC CAAGTGGTCC GAGGAGCGTT 300 

TCAACGAAAT CATCAAGGAG ACTTCCAACT TCATCAAGAA GGTCGGCTAC 350 

45 AACCCCAAGG CCGTTCCTTT CGTCCCCATC TCCGGCTTCA ACGGTGACAA 400 

CATGATTGAG GTCTCCACCA ACTGCCCGTG GTACAAGGGA TGGGAGAAGG 450 

AGTCCAAGGC TGGCAAGGCC ACCGGCAAGA CCCTCCTCGA GGCCATTGAC 500 

GCCATCGACC CACCCACCCG TCCCACCGAC AAGCCTCTCC GTCTCCCTCT 550 

CCAGGATGTC TACAAGATCT CTGGTATCGG AACGGTTCCT GTCGGTCGTG 600 

50 TCGAGACCGG TACCATCAAG GCCGGTATGG TCGTCACCTT CGCTCCGGCC 650 

AACGTCACCA CTGAAGTCAA GTCCGTCGAA ATGCACCACG AGCAGCTCGC 700 

CGAGGGTYTG CCAGGTGACA ACGTTGGCTT CAACGTCAAG AACGTYTCCG 750 

TCAAGGAGGT TCGTCGTGGT AACGTTGCCG GTGACTCCAA GAACGACCCG 800 

692 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

CCCAAGGGTG CCGAGTCCTT CAACGCCCAG 
TGGTCAGATC GGTGCCGGCT ACGCTCCAGT 
ACATTGCTTG CAAGTTCGCC GAGTTGCTCG 
GGAAAGTCCA TCGAGAACAA CCCCAAGTTC 
5 CATCGTCAAG ATGATTCCCA GCAAGCCCAT 



PCT/CA00/01150 

GTCATTGTCC TCAACCACCC 850 

CTTGGATTGC CACACTGCCC 900 

AGAAGATCGA CCGTCGTACC 950 

ATCAAGTCTG GTGATGCTGC 1000 

GTGTGTCGAG GCTTTC 104 6 



2) INFORMATION FOR SEQ ID NO: 1278 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1109 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 
15 (D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Fusarium moniliforme 

(B) STRAIN: WSA-213 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1278 



25 CGTGAGCGTG GTATCACCAT CGATATTGCT CTCTGGAAGT TCGAGACTCC 50 

TCGCTACTAT GTCACCGTCA TTGGTATGTT GTCGCTCATA CCTCATCCTA 100 

CTTCCTCATA CTAACACATC ATTCAGACGC TCCCGGTCAC CGTGATTTCA 150 

TCAAGAACAT GATCACTGGT ACTTCCCAGG CCGATTGCGC CATTCTCATC 200 

-ATTGCCGCCG GTACTGGTGA GTTCGAGGCT GGTATCTCCA AGGATGGCCA 250 

30 GACCCGTGAG CACGCTCTTC TTGCCTACAC CCTTGGTGTC "AAGAACCTCA 300 

TCGTCGCCAT CAACAAGATG GACACCACCA AGTGGTCTGA GGCCCGTTAC 350 

CAGGAGATCA TCAAGGAGAC CTCCTCTTTC ATCAAGAAGG TCGGCTACAA 400 

CCCCAAGGCT GTCGCTTTCG TCCCCATCTC CGGTTTCAAC GGTGACAACA 4 50 

TGCTTACCCC CTCCACCAAC TGCCCCTGGT ACAAGGGTTG GGAGCGTGAG 500 

35 ATCAAGTCCG GCAAGCTCTC CGGCAAGACC CTCCTCGAGG CCATTGACTC 550 

CATCGAGCCT CCCAAGCGTC CCGTTGACAA GCCCCTCCGT CTTCCCCTCC 600 

AGGATGTCTA CAAGATCGGT GGTATTGGAA CGGTTCCCGT CGGCCGTATT 650 

GAGACTGGTG TCATCAAGCC CGGTATGGTC GTTACCTTCG CTCCCTCCAA 700 

CGTCACCACT GAAGTCAAGT CCGTCGARAT GCACCACGAG CAGYTCAGTK 750 

40 AGGGCCAGCC CGGTGACAAC GTTGGTTTCA ACGTGAAGAA CGTTTCCGTC 800 

AAGGACATCC GACGTGGTAA CGTCGCTGGT GACTCCAAGA ACGACCCCCC 850 

CCAGGGTGCC GCTTCTTTCA CCGCCCAGGT CATCGTCCTC AACCACCCCG 900 

GCCAGGTCGG TGCTGGTTAC GCTCCTGTCC TCGATTGCCA CACTGCCCAC 950 

ATTGCCTGCA AGTTCGCCGA GATCCAGGAG AAGATCGACC GCCGAACCGG 1000 

45 TAAGGCTACT GAGGCCGCTC CCAAGTTCAT CAAGTCTGGT GACTCCGCCA 1050 

TCGTCAAGAT GGTTCCCTCC AAGCCCATGT GTGTCGAGGC TTTCACTGAC 1100 

TACCCTCCT 1109 



50 

2) INFORMATION FOR SEQ ID NO: 1279 

(i) SEQUENCE CHARACTERISTICS: 

693 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(A) LENGTH: 765 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus terreus 
10 (B) STRAIN: WSA-174 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1279 

CCTGCAAGTG GTCTGAGGAC CGTTACAACG AAATCGTGAA GGAGACCTCC 50 

15 AACTTCATCA AGAAGGTCGG CTACAACCCC AAGGCCGTTC CCTTCGTCCC 100 

CATCTCCGGT TTCAACGGTG ACAACATGCT TGAGCCTTCC CCCAACTGCC 150 

CCTGGTACAA GGGTTGGGAG AAGGAGGGCA AGTCCGGCAA GGTCACCGGT 200 

AAGACTCTCC TCGAGGCCAT CGATGCCATC GAGCCCCCCG TCCGTCCTGC 250 

CAACAAGCCT CTCCGTCTTC CCCTCCAGGA TGTGTACAAG ATCTCTGGTA 300 

20 TCGGAACTGT CCCCGTCGGC CGTGTCGAGA CTGGTGTCAT CACCCCCGGC 350 

ATGGTTGTTA CCTTCGCTCC TTCCAACGTC ACCACTGAAG TGAAGTCCGT 400 

TGAGATGCAC CACCAGCAGC TCAAGGAGGG TCTCCCCGGT GACAACGTTG 4 50 

GTTTCAACGT CAAGAACGTC TCCGTCAAGG AGGTCCGTCG TGGTAACGTC 500 

GCTGGTGACT CCAAGAACGA CCCCCCTGCT GGCGCTGCCT CCTTCACCGC 550 

25 CCAGGTCATC GTTCTCAACC ACCCCGGTCA GGTCGGCGCT GGCTACGCCC 600 

CCGTCCTCGA CTGCCACACT GCCCACATTG CCTGCAAGTT . CGCTGAGCTC 650 

CAGGAGAAGA TTGACCGCCG TACCGGAAAG TCTGTTGAGT CTTCTCCCAA 700 

GTTCATCAAG TCTGGTGATG CTGCCATCGT CAAGATGATC CCTYCAAAGC 750 

CGATGTGCGT CGAAG 765 

30 



2) INFORMATION FOR SEQ ID NO: 1280 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1105 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 
45 (B) STRAIN: ATCC 64746 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1280 

GCGTGGTATC ACCATCGACA TTGCCCTCTG GAAGTTCCAG ACTCCCAAGT 50 

50 ATGAGGTCAC TGTCATCGGT AAGCTCGACT CGCCCCGATA TGTTTTGGTG 100 

CTGTAGCTAA CACGATCTGA AGATGCCCCC GGTCACCGTG ACTTCATCAA 150 

GAACATGATC ACTGGTACCT CCCAGGCTGA CTGCGCTATC CTCATCATTG 200 

CCTCCGGTAC TGGTGAGTTC GAGGCTGGTA TCTCCAAGGA TGGCCAGACC 250 

694 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

CGTGAGCACG CTCTGCTGGC TTTCACCCTC GGTGTCAAGC AGCTCATCGT 300 

CGCCCTCAAC AAGATGGACA CCTGCAAGTG GTCCGAGGAT CGTTACAACG 350 

AAATTGTCAA GGAAACCTCC AACTTCATCA AGAAGGTCGG CTACAACCCC 400 

AAGGCCGTTC CCTTCGTMCC CATCKCTGGC TTCAACGGTG ACAACATGCT 450 

5 TGAGCCCTCC TCCAACTGCC CCTGGTACAA GGGATGGGAG AAGGAGACCA 500 

AGGCCGGCAA GGTCACTGGT AAGACCCTCA TCGAGGCCAT YGACGCCATT 550 

GAGCCCCCTG TCCGTCCCTC CAACAAGCCC CTYCGTCTTC CCCTCCAGGA 600 

TGTYTACAAG ATCTCTGGTA TCGGAACGGT CCCTGTCGGC CGTGTCGAGA 650 

CCGGTATCAT CAAGCCCGGC ATGGTCGTCA CCTTTGCCCC CGCCAACGTC 700 

10 ACCACTGAAG TCAAGTCCGT CGAAATGCAC CACCAGCAGC TCCAGGAGGG 750 

TGTCCCCGGT GACAACGTCG GTTTCAACGT CAAGAACGTT TCCGTCAAGG 800 

AAGTCCSCCG TGGTAACGTT TGCGGTGACT CCAAGAACGA TCCCCCTCAG 850 

GGTGCTGCCT CCTTCAACGC CCAGGTCATC GTCCTCAACC ACCCCGGTCA 900 

GGTCGGCGCT GGTTACGCCC CCGTCCTCGA CTGCCACACT GCCCACATTG 950 

15 CTTGCAAGTT CTYTGAGCTG CTTGAGAAGA TTGACCGCCG TACCGGCAAG 1000 

TCTGTTGAGA ACAACCCCAA GTTCATCAAG TCCGGTGAWG CCGCCATCGT 1050 

GAAGATGGTT CCTTCCAAGC CCATGTGTGT CGAGTCCTTC ACTGACTACC 1100 

CCCCT 1105 

20 

2) INFORMATION FOR SEQ ID NO: 1281 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 1343 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus laurentii 

(B) STRAIN: ATCC 44096 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1281 



GCCGAGCGAG AGCGAGGTAT CACCATCGAC ATTGCTCTCT GGAAGTTCGA 50 

GACCCCCAAG TACAACGTCA CCGTCATTGA CGCCCCCGGA CACCGAGACT 100 

40 TCATCAAGAA CATGATCACC GGTACCTCCC AGGCCGACTG CGCCATCCTT 150 

ATCATTGCCA CCGGTATCGG AGAGTTCGAG GCCGGTATCT CTAAGGACGG 200 

TCAGACCCGA GAGCACGCTT TGCTCGCCTT CACCCTCGGT GTCAGGCAGC 250 

TCATCGTTGC TTGCAACAAG ATGGACACCT GCAAGTGGTC CGAGGACCGA 300 

TTCAACGAGA TCGTCAAGGA GACCAACGGT TTCATCAAGA AGGTCGGATA 350 

45 CAACCCCAAG GCTGTTCCCT TCGTCCCCAT CTCTGGATGG CACGGAGACA 400 

AGTGAGTGCC GTTCTTTGCG TTGAGCCCTC TTTGTCGCTC CCCCTCCCTC 450 

TCAAGTGGCG GCGGCGGTCT CCACCCACAA ATCGGGTGGC GAATCCGCCA 500 

CACCCACCAC TTCTCGCCAC CGAGTGTGGC ACTTCTTCCA ACTCCTCTTT 550 

CCACTCCTCC TCGTCTCGTC TCTTTTTTTC TCCGTTGTCT TTGACAAGGG 600 

50 GAGTGTGCTG ATAGTAAAGC ATGCTTGAGG AGACCACCAA CATGCCGTGG 650 

TACAAGGGAT GGACCAAGGA GACCAAGTCC GGTGTCGTTA AGGGTAAGAC 700 

CCTCCTCGAC GCCATCGACG CCATCGAGCC TCCTCAACGA CCCACCGACA 750 

AGCCCCTCCG ACTTCCCCTC CAGGATGTCT ACAAGATCGG TGGTATCGGT 800 

695 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



ACGGTGCCCG TCGGCCGAGT CGAGACCGGT 

CACGGGAGCC TGACGGCTGT TGTTGCCACA 

GTCGTCACCT TCGCTCCTAC CAACGTCACC 

GATGCACCAC GAGCAGATCC CTGAGGGTCT 

5 TCAACGTGAA GAACGTTTCC ATCAAGGACA 

TCCGACTCCA AGAACGACCC CGCTAAGGAG 

GGTCATTGTC CTCAACCACC CTGGACAGAT 

TCCTCGACTG CCACACCGCC CACATTGCCT 

GAGAAGATCG ACCGACGAAC TGGTAAGACC 

10 CGTCAAGTCC GGAGACGCCG CCATTGTCAA 

TCTGTGTCGA GTCTTACTCT GACTACCCTC 



GTCATCAAGG CCGGGTAAGT 850 

CCCAACTTAT ATCCAGTATG 900 

ACTGAGGTTA AGTCCGTTGA 950 

TCCCGGAGAC AACGTTGGTT 1000 

TCCGACGAGG AAACGTYTGC 1050 

GCCGCTTCTT TCAACGCCCA 1100 

TGGTGCCGGT TACACCCCCG 1150 

GCAAGTTCGC CGAGCTCATC 1200 

ATGGAGGCCG CCCCCAAGTT 1250 

GCTCGTTGCC CAGAAGCCCA 1300 

CCCTTGGACG ATT 1343 



15 2) INFORMATION FOR SEQ ID NO: 1282 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 734 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Emmonsia parva 

(B) STRAIN: ATCC 10784 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1282 



TGGTCCGAGG CTCGTTTCAA CGAAATCATC AAGGAAGTCA CCAACTTCAT 50 

CAAGAAGGTC GGATACAACC CCAAGTCTGT TCCCTTCGTG CCAATCTCTG 100 

GTTTCGAGGG TGATAACATG ATTGAGCCCT CCACCAACTG CCCCTGGTAC 150 

AAGGGCTGGA CCAAGGAGAC CGCTGCCGGC AAGTCAACCG GTAAGACTCT 200 

3 5 TCTTGACGCC ATTGATGCCA TCGATCAACC CTCCCGTCCT ACCGACAAGC 250 

CCCTCCGTCT TCCCCTCCAG GATGTGTACA AGATCTCCGG TATTGGCACT 300 

GTTCCCGTTG GACGTGTTGA GACTGGTATC ATCAAGCCTG GTATGGTCGT 350 

GACCTTCGCT CCCTCCAACG TCACCACTGA AGTCAAGTCC GTCGAAATGC 400 

ACCACCAACA GCTCTTGGCT GGTAACCCCG GTGACAACGT CGGTTTCAAC 450 

4 0 GTCAAGAACG TTTCCGTCAA GGAAGTCCGC CGTGGCAACG TTGCTGGTGA 500 

CTCAAAGAAC GACCCCCCCA AGGGCTGCGA CTCCTTCAAC GCCCAGGTCA 550 

TCGTCCTCAA TCACCCCGGT CAAGTTGGCG CTGGTTATGC CCCAGTCCTC 600 

GACTGCCACA CTGCCCACAT TGCTTGCAAG TTCTCTGAGC TCCTCGAGAA 650 

GATTGACCGC CGTACCGGAA AGTCCACTGA GAACAACCCC AAGTTCATCA 700 

45 AGTCTGGTGA CGCCGCTATC GTCAAGATGG TTCC 734 



2) INFORMATION FOR SEQ ID NO: 1283 

* 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1107 bases 

(B) TYPE: Nucleic acid 

696 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium solani 

(B) STRAIN: ATCC 62877 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1283 

CGTGAGCGTG GTATCACCAT CGACATTGCC CTCTGGAAGT TCGAGACTCC 50 

CCGCTACTAT GTCACCGTCA TTGGTATGTT GCTGTCACCT CTCTCACACA 100 

TGTCTCACCA CTAACAATCA ACAGACGCCC CCGGCCACCG TGACTTCATC 150 

15 AAGAACATGA TCACTGGTAC TTCCCAGGCC GACTGCGCCA TTCTCATCAT 200 

TGCCGCTGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG GATGGCCAGA 250 

CCCGTGAGCA CGCCCTGCTC GCCTACACCC TCGGTGTCAA GAACCTCATT 300 

GTCGCCATCA ACAAGATGGA CACCACCAAG TGGTCCGAGT CCCGTTACCA 350 

GGAGATCATC AAGGAGACCT CCAACTTCAT CAAGAAGGTC GGCTACAACC 400 

2 0 CCAAGGCTGT CGCTTTCGTC CCCATCTCCG GTTTCAACGG CGACAACATG 450 

CTTACTCCCT CCACCAACTG CCCCTGGTAC AAGGGCTGGG AGCGTGAGAT 500 

CAAGTCCGGC AAGCTCACTG GCAAGACCCT CCTCGAGGCC ATTGACTCCA 550 

TCGAGCCCCC CAAGCGTCCC GTCGACAAGC CCCTCCGACT TCCCCTCCAG 600 

GATGTCTACA AGATCGGTGG TATTGGCACG GTTCCCGTCG GCCGTATCGA 650 

25 GACTGGTGTC ATCAAGCCCG GTATGGTCGT TACCTTCGCC CCCTCCAACG 700 

TCACCACTGA AGTCAAGTCC GTCGAGATGC ACCACGAGCA GCTCTYTGAG 750 

GGTCTTCCCG GTGACAACGT CGGCTTCAAC GTKAAGAACG TYTCCGTCAA 800 

GGAGATCCGA CGTGGCAACG TCGCTGGTGA CTCCAAGAAC GACCCCCCTY 850 

TGGGTGCCGC CTCTTTCACC GCCCAGGTCA TTGTCCTCAA CCACCCTGGC 900 

3 0 CAGGTCGGTG CCGGTTACGC CCCCGTTYTG GACTGCCACA CTGCCCACAT 950 

TGCCTGCAAG TTCGCCGAGA TCCAGGAGAA GATCGACCGC CGAACTGGTA 1000 

AGGCTGTTGA GTCCGCCCCC AAGTTCATCA AGTCTGGTGA CTCCGCCATC 1050 

GTCAAGATGG TTCCCTCCAA GCCCATGTGC GTTGAGGCTT TCACTGACTA 1100 

CCCCCCT 1107 

35 



2) INFORMATION FOR SEQ ID NO: 1284 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1045 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sporothrix schenckii 
50 (B) STRAIN: ATCC 14285 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1284 



697 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



GCTCAAGGCC GAGCGTGAGC GCGGTATCAC CATCGATATT GCTCTGTGGA 50 

AGTTCGAGAC CCCCAAGTAC TACGTCACCG TCATTGACGC CCCCGGTCAT 100 

CGCGATTTCA TCAAGAACAT GATCACTGGT ACCTCGCAGG CCGACTGCGC 150 

CATTCTCATC ATTGCCGCTG GTACTGGTGA GTTCGAGGCT GGTATCTCCA 200 

5 AGGATGGCCA GACTCGTGAG CACGCTCTGC TCGCCTACAC CCTGGGTGTG 250 

CGGCAGCTGA TCGTCGCCAT CAACAAGATG GACACGGCCA AGTGGGCTGA 300 

GGCTCGTTAC CAGGAGATCA TCAAGGAGAC CTCCAACTTC ATCAAGAAGG 350 

TCGGCTACAA CCCCAAGACT GTTGCCTTCG TCCCCATCTC GGGCTTCCAC 400 

GGCGACAACA TGCTTACTCC CTCGACCAAC TGCCCCTGGT ACAAGGGCTG 4 50 

10 GGAGAAGGAG GGCAAGAGCG GCAAGGTTAC CGGTAAGACT CTGCTGGACG 500 

CCATTGACGC CGTCGAGCCC CCCAAGCGCC CCACGGACAA GCCCCTGCGT 550 

CTGCCCCTCC AGGATGTCTA CAAGATCGGC GGTATCGGCA CTGTCCCTGT 600 

CGGCCGTATC GAGACTGGTG TCCTGAAGCC CGGCATGGTC GTCACCTTTG 650 

CCCCGTCCAA CGTCACCACT GAAGTCAAGT CCGTCGAGAT GCACCACGAG 700 

15 CAGCTTGTTG AGGGTGTTCC CGGCGACAAC GTCGGCTTCA ACGTCAAGAA 750 

CGTYTCCGTC AAGGAGATCC GTYGTGGCAA CGTTGCCGGT GACTCCAAGA 800 

ACGACCCCCC CTCGGGCGCC GCCACCTTCA ACGCCCAGGT CATTGTCCTG 850 

AACCACCCCG GCCAGGTCGG CAACGGCTAC GCCCCGGTTY TGGACTGCCA 900 

CACCGCCCAC ATTGCCTGCA AGTTCACCGA GATCCTTGAG AAGATCGACC 950 

2 0 GCCGTACCGG CAAGTCGGTT GAGAACAACC CCAAGTTCAT CAAGTCGGGT 1000 

GACGCCGCCA TTGTCAAGCT GACGCCCTYG AAGCCCATGT GCGTT 1045 



25 2) INFORMATION FOR SEQ ID NO: 1285 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 764 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus nidulans 

(B) STRAIN: WSA-176 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1285 



TTGCCAGTGG TCCGAGGCCC GTTACAACGA AATCGTCAAG GAGACTTCCG 50 

GTTTCATCAA GAAGGTCGGA TACAACCCCA AGTCCGTTGC CTTCGTCCCC 100 

ATCTCCGGTT TCAACGGTGA CAACATGCTC GAGGCCTCTA CCAACTGCCC 150 

CTGGTACAAG GGTTGGGAGA AGGAGACCAA GGCCGGTAAG GCCACTGGTA 200 

45 AGACCCTCCT TGAGGCCATC GACGCCATTG AGCCCCCCAC CCGTCCCTCC 250 

AACAAGCCCC TCCGTCTTCC CCTCCAGGAT GTCTACAAGA TCTCCGGTAT 300 

TGGAACTGTC CCCGTCGGCC GTGTCGAGAC TGGTGTTATC ACCCCCGGCA 350 

TGGTCGTCAC CTTCGCTCCT GCCAACGTCA CCACTGAAGT CAAGTCCGTT 400 

GAGATGCACC ACCAGCAGCT CAAGGAGGGT GTCCCCGGTG ACAACGTCGG 4 50 

50 TTTCAACGTC AAGAACGTTT CCGTCAAGGA AATCCGTCGT GGTAACGTTG 500 

CCTCCGACTC CAAGAACGAC CCCGCCTCCG GCGCTGCCTC TTTCAACGCC 550 

CAGGTCATCG TTCTCAACCA CCCCGGTCAG GTCGGTGCTG GTTACGCCCC 600 

CGTCCTCGAC TGCCACACCG CCCACATTGC TTGCAAGTTC TCTGAGCTTC 650 

698 
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TTGAGAAGAT TGACCGCCGT ACCGGAAAGG CTGTTGAAAC CAGCCCCAAG 
TTCATCAAGT CCGGTGACGC TGCCATCGTC AAGATGATTC CTTCCAAGCC 
CATGTGCGKT CCGA 



700 
750 
764 



2) INFORMATION FOR SEQ ID NO: 1286 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 971 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cladophialophora carrionii 

(B) STRAIN: ATCC 16264 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1286 

ACCATCGATA TCGCGCTCTG GAAGTTCGAG ACTCCCAAGT ACTTCGTCAC 50 

CGTCATCGAT GCCCCTGGTC ATCGTGACTT CATCAAGAAC ATGATCACTG 100 

GTACCTCCCA GGCTGATTGT GCTATTCTCA TCATTGCCGC TGGTACTGGT 150 

GAGTTCGAGG CCGGTATCTC CAAGGATGGC CAGACCCGTG AGCATGCTCT 200 

GCTCGCCTAC ACCCTGGGCG TGAAGCAGCT TATCGTCGCC ATCAACAAGA 250 

TGGACACCAC CAAATGGTCT GAGGATCGTT TCAACGAAAT CATCAAGGAG 300 

ACTTCCAACT TCATCAAGAA GGTCGGATAC AACCCCAAGT CCGTTCCATT 350 

CGTGCCCATC TCCGGCTTCA ACGGTGACAA CATGATCGAC GTCTCCACCA 4 00 

ATGCCCCCTG GTACAAGGGC TGGGAAAAGG AGTCCAAGGC TGGCAAGGCC 4 50 

ACCGGCAAGA CCCTCCTTGA GGCTATCGAC TCCATCGACC CTCCTGCTCG 500 

TCCCACCGAC AAGCCTCTCC GTCTCCCACT CCAGGATGTC TACAAGATTT 550 

CTGGTATCGG CACGGTGCCC GTCGGTCGTG TTGAGACTGG TACCATCAAG 600 

GCCGGTATGG TCGTCACCTT TGCCCCCGCC AACGTCACCA CTGAAGTCAA 650 

GTCCGTCGAA ATGCACCACG AACAGCTYGC CGAGGGCGTT CCGGGTGACA 7 00 

ACGTCGGCTT CAACGTCAAG AACGTYTCCG TGAAGGAGGT TCGTCGTGGA 7 50 

AACGTTGCTG GTGACTCCAA GAACGACCCC CCCAAGGGTG CCGACTCCTT 800 

CAACGCCCAG GTCATCGTCC TCAACCACCC TGGTCAGGTC GGTGCTGGCT 850 

ACGCCCCGGT CTTGGATTGC CACACTGCCC ACATTGCCTG CAAGTTCTYT 900 

GAGCTCCTCG AGAAGATCGA TCGTCGKACC GGCAAGTCCA TGGAAAACAA 950 

CCCCAAGTTC ATCAAGTCTG G 971 



2) INFORMATION FOR SEQ ID NO: 1287 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 732 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Exserohilum rostratum 

(B) STRAIN: WSA-215 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1287 

GGTCTGAGGA CCGTTACCAG GAG AT CATC A AGGAGACCTC CAACTTCATC 50 

AAGAAGGTCG GCTACAACCC CAAGCACGTT CCCTTCGTCC CCATCTCCGG 100 

TTTCAACGGA GACAACATGA TCGAGGCCTC CAGCAACTGC CCCTGGTACA 150 

AGGGTTGGGA GAAGGAGACC AAGGCCAAGG CCACTGGTAA GACCCTCCTT 200 

GAGGCCATTG ACGCCATCGA CCCTCCCAGC CGTCCTACCG ACAAGCCCCT 250 

CCGTCTTCCC CTCCAGGATG TCTACAAGAT TGGTGGTATT GGCACGGTTC 300 

CCGTCGGTCG TGTCGAGACC GGTATCATCA AGGCCGGTAT GGTCGTCACC 350 

TTCGCCCCCG CTGGTGTCAC CACTGAAGTC AAGTCCGTCG AGATGCACCA 4 00 

CGAGCAGCTT ACCGAGGGTG TCCCCGGTGA CAACGTCGGC TTCAACGTCA 450 

AGAACGTCTC CGTCAAGGAG ATCCGTCGTG GTAACGTTGC CGGTGACTCC 500 

AAGAACGACC CCCCCAAGGG CTGCGAGTCT TTCAACGCTC AGGTCATTGT 550 

CCTCAACCAC CCTGGTCAGG TCGGTGCCGG TTACGCGCCA GTCCTCGACT 600 

GCCACACCGC CCACATTGCC TGCAAGTTCT CTGAGCTCCT CGAGAAGATT 650 

GACCGCCGTA CCGGAAAGTC TGTCGAAGCC TCTCCCAAGT TCATCAAGTC 700 

TGGTGACGCG GCCATCGTCA AGATGGTTCC CT 7 32 



2) INFORMATION FOR SEQ ID NO: 1288 

(i) SEQUENCE CHARACTERISTICS: 

(A) ' LENGTH: 337 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1236 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1288 

AATGGATCCT GTATACGCAC AAAAATTAGG CGTTAACATA GATGAATTRC 50 

TATTATCACA GCCTGATACA GGGGAGCAAG GATTGGAAAT CGCGGAAGCA 100 

CTTGTACGAA GTGGTGCGGT TGACATTATC GTAATTGACT CTGTAGCAGC 150 

TCTTGTACCG AAAGCAGAGA TTGAAGGCGA TATGGGTGAC TCACACGTAG 200 

GTTTACAAGC ACGTTTAATG TCACAAGCAC TTMGTAAGCT TTCAGGAGCA 250 

ATCAACAAAT CARRARCAAT WGCAATCTTT ATTAACCAAA TTCGWGAAAA 300 

AGTTGGGGTT ATGTTCGGAA ACCCAGAAAC AACTCCA 337 



2) INFORMATION FOR SEQ ID NO: 1289 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 336 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1232 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1289 

AATGGATCCT GTATATGCAC AAAAATTAGG CGTTAACATA GATGAATTRC 50 

TATTATCACA GCCTGATACA GGGGAGCAAG GATTGGAAAT CGCGGAAGCA 100 

CTTGTACGAA GTGGTGCGGT TGACATTATC GTAATTGACT CTGTAGCAGC 150 

TCTTGTACCG AAAGCAGAGA TTGAAGGCGA TATGGGTGAC TCACACGTAG 200 

GTTTACAAGC ACGTTTAATG TCACAAGCAC TTMGTAAGCT TTCAGGAGCA 250 

ATCAACAAAT CARAARCAAT TGCAATCTTT ATTAACCAAA TTCGTGAAAA 300 

AGTTGGGGTT ATGTTCGGAA ACCCAGAAAC AACTCC 336 



2) INFORMATION FOR SEQ ID NO: 1290 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1290 
GAYTAYGCIA TGISIGTIAT HGT 



2) INFORMATION FOR SEQ ID NO: 1291 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1291 



701 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

GCIYTICCIG AYGTIMGIGA YGG 



PCT/CA00/01150 

23 



5 2) INFORMATION FOR SEQ ID NO: 1292 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1292 

ARISCYTCIA RIATRTGIGC 20 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1293 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1293 
ATGGCTGAAT TACCTCAATC 20 



2) INFORMATION FOR SEQ ID NO: 1294 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45. (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1294 
ATGATTGTTG TATATCTTCT TCAAC 25 

50 



2) INFORMATION FOR SEQ ID NO: 1295 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1295 
CAGAAAGTTT GAAGCGTTGT 



2) INFORMATION FOR SEQ ID NO: 1296 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1296 
AACGATTCGT GAGTCAGATA 



2) INFORMATION FOR SEQ ID NO: 1297 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1297 
CGGTCAACAT TGAGGAAGAG CT 



2) INFORMATION FOR SEQ ID NO: 1298 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1298 
ACGAAATCGA CCGTCTCTTT TTC 



2) INFORMATION FOR SEQ ID NO: 1299 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2711 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: 601055 

(C) ACCESSION NUMBER: X71437 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1299 

ATGGCTGAAT TACCTCAATC AAGAATAAAT GAACGAAATA TTACCAGTGA 50 

AATGCGTGAA TCATTTTTAG ATTATGCGAT GAGTGTTATC GTTGCTCGTG 100 

CATTGCCAGA TGTTCGTGAC GGTTTAAAAC CAGTACATCG TCGTATACTA 150 

TATGGATTAA ATGAACAAGG TATGACACCG GATAAATCAT ATAAAAAATC 200 

AGCACGTATC GTTGGTGACG TAATGGGTAA ATATCACCCT CATGGTGACT 250 

CATCTATTTA TGAAGCAATG GTACGTATGG CTCAAGATTT CAGTTATCGT 300 

TATCCGCTTG TTGATGGCCA AGGTAACTTT GGTTCAATGG ATGGAGATGG 350 

CGCAGCAGCA ATGCGTTATA CTGAAGCGCG TATGACTAAA ATCACACTTG 400 

AACTGTTACG TGATATTAAT AAAGATACAA TAGATTTTAT CGATAACTAT 4 50 

GATGGTAATG AAAGAGAGCC GTCAGTCTTA CCTGCTCGAT TCCCTAATTT 500 

ATTAGCCAAT GGTGCATCAG GTATCGCGGT AGGTATGGCA ACGAATATTC 550 

CACCACATAA CTTAACAGAA TTAATCAATG GTGTACTTAG CTTAAGTAAG 600 

AACCCTGATA TTTCAATTGC TGAGTTAATG GAAGATATTG AAGGTCCTGA 650 

TTTCCCAACT GCTGGACTTA TTTTAGGTAA GAGTGGTATT AGACGTGCAT 700 

ATGAAACAGG TCGTGGTTCA ATTCAAATGC GTTCTCGTGC AGTTATTGAA 7 50 

GAACGTGGAG GCGGACGTCA ACGTATTGTT GTCACTGAAA TTCCTTTCCA 800 

AGTGAATAAG GCTCGTATGA TTGAAAAAAT TGCAGAGCTC GTTCGTGACA 850 

AGAAAATTGA CGGTATCACT GATTTACGTG ATGAAACAAG TTTACGTACT 900 

GGTGTGCGTG TCGTTATTGA TGTGCGTAAG GATGCAAATG CTAGTGTCAT 950 

TTTAAATAAC TTATACAAAC AAACACCTCT TCAAACATCA TTTGGTGTGA 1000 

ATATGATTGC ACTTGTAAAT GGTAGACCGA AGCTTATTAA TTTAAAAGAA 1050 

GCGTTGGTAC ATTATTTAGA GCATCAAAAG ACAGTTGTTA GAAGACGTAC 1100 

GCAATATAAC TTACGTAAAG CTAAAGATCG TGCCCATATT TTAGAAGGGT 1150 

TACGTATCGC ACTTGACCAT ATCGATGAAA TTATTTCAAC GATTCGTGAG 1200 

TCAGATACAG ATAAAGTTGC AATGGAAAGC TTGCAACAAC GCTTCAAACT 1250 

TTCTGAAAAA CAAGCTCAAG CTATTTTAGA CATGCGTTTA AGACGTCTAA 1300 
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CAGGTTTAGA GAGAAACAAA ATTGAAGCTG 
TATATTAGTG AATTAGAAGC CATCTTAGCT 
GTTAGTTAGA GATGAATTGA CTGAAATTAG 
GTCGTACAGA AATTCAATTA GGTGGATTTG 
TTAATTCCAG AAGAACAAAT AGTAATTACT 
TAAACGTTTG CCGGTATCTA CATATCGTGC 
GTGTTCAAGG TATGAATACA TTGGAAGAAG 
ACTTTAAGTA CACATGACCA TGTATTGTTC 
ATACAAACTA AAAGGTTATG AAGTGCCTGA 
GTATTCCTGT AGTGAATGCT ATTGAACTTG 
ACAATGATTG CTGTTAAAGA CCTTGAAAGT 
TGCAACTAAA CGTGGTGTTG TTAAACGTTC 
GAATAAATAG AAATGGTAAG ATTGCGATTT 
TTAATTGCAG TTCGTTTAAC AAGTGGTCAA 
ATCACATGCA TCATTAATTC GATTCCCTGA 
GCCGTACAGC AACGGGTGTG AAAGGTATTA 
GTTGTAGGGC TTGATGTAGC TCATGCAAAC 
AGTTACTGAA AATGGTTATG GTAAACGTAC 
TATCAAATCG TGGTGGTAAA GGTATTAAAA 
AATGGTAATG TTGTATGTAT CACTACAGTA 
GATTGTTACT AATGCCGGTG TCATTATTCG 
CTCAAAATGG TCGTGCAGCA CAAGGTGTTC 
GATCAATTTG TTTCAACGGT TGCTAAAGTA 
AAATGAAGAT GAACAATCTA CTGTATCTGA 
GTGAAGCGGT TGTAAATGAT GAAACACCAG 
GTGATTGATT CAGAAGAAAA TGATGAAGAT 
AGATTTCATG GATCGTGTTG AAGAAGATAT 
ATGAAGAATA A 



PCT/CAOO/01150 

AATATAATGA GTTATTAAAT 1350 

GATGAAGAAG TGTTATTACA 1400 

AGATCGTTTC GGTGATGAGC 1450 

AAGACTTAGA GGACGAAGAC 1500 

TTGAGCCATA ATAACTACAT 1550 

TCAAAACCGT GGTGGTCGTG 1600 

ATTTTGTCAG TCAATTGGTA 1650 

TTTACTAACA AAGGTCGTGT 1700 

GTTATCAAGA CAGTCTAAAG 1750 

GAAATGATGA AGTCATTAGT 1800 

GAAGACAACT TCTTAGTGTT 1850 

AGCATTAAGT AACTTCTCAA 1900 

CGTTCAGAGA AGATGATGAG 1950 

GAAGATATCT TGATTGGTAC 2000 

ATCAACATTA CGTCCTTTAG 2050 

CACTTCGTGA AGGTGACGAA 2100 

AGTGTTGATG AAGTATTAGT 2150 

GCCAGTTAAT GACTATCGTT 2200 

CAGCTACGAT TACTGAGCGT 2250 

ACTGGTGAAG AAGATTTAAT 2300 

ACTAGATGTT GCAGATATTT 2350 

GCTTAATTCG CTTAGGCGAT 2400 

AAAGAGGATG CAGATGAAGT 2 4 50 

AGATGGTACT GAACAACAAC 2500 

GAAATGCAAT TCATACTGAA 2550 

GGACGTATTG AAGTAAGACA 2 600 

ACAACAATCA TCAGATGAAG 2650 

2711 



2) INFORMATION FOR SEQ ID NO: 1300 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2628 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: K12 

(C) ACCESSION NUMBER: X57174 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1300 

ATGAGCGACC TTGCGAGAGA AATTACACCG GTCAACATTG AGGAAGAGCT 50 

GAAGAGCTCC TATCTGGATT ATGCGATGTC GGTCATTGTT GGCCGTGCGC 100 

TGCCAGATGT CCGAGATGGC CTGAAGCCGG TACACCGTCG CGTACTTTAC 150 

GCCATGAACG TACTAGGCAA TGACTGGAAC AAAGCCTATA AAAAATCTGC 200 

CCGTGTCGTT GGTGACGTAA TCGGTAAATA CCATCCCCAT GGTGACTCGG 250 
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CGGTCTATGA CACGATCGTC CGCATGGCGC AGCCATTCTC GCTGCGTTAT 300 

ATGCTGGTAG ACGGTCAGGG TAACTTCGGT TCTATCGACG GCGACTCTGC 350 

GGCGGCAATG CGTTATACGG AAATCCGTCT GGCGAAAATT GCCCATGAAC 400 

TGATGGCCGA TCTCGAAAAA GAGACGGTCG ATTTCGTTGA TAACTATGAC 450 

5 GGCACGGAAA AAATTCCGGA CGTCATGCCA ACCAAAATTC CTAACCTGCT 500 

GGTGAACGGT TCTTCCGGTA TCGCCGTAGG TATGGCAACC AACATCCCGC 550 

CGCACAACCT GACGGAAGTC ATCAACGGTT GTCTGGCGTA TATTGATGAT 600 

GAAGACATCA GCATTGAAGG GCTGATGGAA CACATCCCGG AGCCGGACTT 650 

CCCGACGGCG GCAATCATTA ACGGTCGTCG CGGTATTGAA GAAGCTTACC 700 

10 GTACCGGTCG CGGCAAGGTG TATATCCGCG CTCGCGCAGA AGTGGAAGTT 750 

GACGCCAAAA CCGGTCGTGA AACCATTATC GTCCACGAAA TTCCGTATCA 800 

GGTAAACAAA GCGCGCCTGA TCGAGAAGAT TGCGGAACTG GTAAAAGAAA 850 

AACGCGTGGA AGGCATCAGC GCGCTGCGTG ACGAGTCTGA CAAAGACGGT 900 

ATGCGCATCG TGATTGAAGT GAAACGCGAT GCGGTCGGTG AAGTTGTGCT 950 

15 CAACAACCTC TACTCCCAGA CCCAGTTGCA GGTTTCTTTC GGTATCAACA 1000 

TGGTGGCATT GCACCATGGT CAGCCGAAGA TCATGAACCT GAAAGACATC 1050 

ATCGCGGCGT TTGTTCGTCA CCGCCGTGAA GTGGTGACCC GTCGTACTAT 1100 

TTTCGAACTG CGTAAAGCTC GCGATCGTGC TCATATCCTT GAAGCATTAG 1150 

CCGTGGCGCT GGCGAACATC GACCCGATCA TCGAACTGAT CCGTCATGCG 1200 

20 CCGACGCCTG CAGAAGCGAA AACTGCGCTG GTTGCTAATC CGTGGCAGCT 1250 

GGGCAACGTT GCCGCGATGC TCGAACGTGC TGGCGACGAT GCTGCGCGTC 1300 

CGGAATGGCT GGAGCCAGAG TTCGGCGTGC GTGATGGTCT GTACTACCTG 1350 

ACCGAACAGC AAGCTCAGGC GATTCTGGAT CTGCGTTTGC AGAAACTGAC 1400 

CGGTCTTGAG CACGAAAAAC TGCTCGACGA ATACAAAGAG CTGCTGGATC 1450 

25 AGATCGCGGA ACTGTTGCGT ATTCTTGGTA GCGCCGATCG TCTGATGGAA 1500 

GTGATCCGTG AAGAGCTGGA GCTGGTTCGT GAACAGTTCG GTGACAAACG 1550 

TCGTACTGAA ATCACCGCCA ACAGCGCAGA CATCAACCTG GAAGATCTGA 1600 

TCACCCAGGA AGATGTGGTC GTGACGCTCT CTCACCAGGG CTACGTTAAG 1650 

TATCAGCCGC TTTCTGAATA CGAAGCGCAG CGTCGTGGCG GGAAAGGTAA 1700 

30 ATCTGCCGCA CGTATTAAAG AAGAAGACTT TATCGACCGA CTGCTGGTGG 1750 

CGAACACTCA CGACCATATT CTGTGCTTCT CCAGCCGTGG TCGCGTCTAT 1800 

TCGATGAAAG TTTATCAGTT GCCGGAAGCC ACTCGTGGCG CGCGCGGTCG 1850 

TCCGATCGTC AACCTGCTGC CGCTGGAGCA GGACGAACGT ATCACTGCGA 1900 

TCCTGCCAGT GACCGAGTTT GAAGAAGGCG TGAAAGTCTT CATGGCGACC 1950 

35 GCTAACGGTA CCGTGAAGAA AACTGTCCTC ACCGAGTTCA ACCGTCTGCG 2000 

TACCGCCGGT AAAGTGGCGA TCAAACTGGT TGACGGCGAT GAGCTGATCG 2050 

GCGTTGACCT GACCAGCGGC GAAGACGAAG TAATGCTGTT CTCCGCTGAA 2100 

GGTAAAGTGG TGCGCTTTAA AGAGTCTTCT GTCCGTGCGA TGGGCTGCAA 2150 

CACCACCGGT GTTCGCGGTA TTCGCTTAGG TGAAGGCGAT AAAGTCGTCT 2200 

40 CTCTGATCGT GCCTCGTGGC GATGGCGCAA TCCTCACCGC AACGCAAAAC 2250 

GGTTACGGTA AACGTACCGC AGTGGCGGAA TACCCAACCA AGTCGCGTGC 2300 

GACGAAAGGG GTTATCTCCA TCAAGGTTAC CGAACGTAAC GGTTTAGTTG 2350 

TTGGCGCGGT ACAGGTAGAT GACTGCGACC AGATCATGAT GATCACCGAT 2400 

GCCGGTACGC TGGTACGTAC TCGCGTTTCG GAAATCAGCA TCGTGGGCCG 2450 

45 TAACACCCAG GGCGTGATCC TCATCCGTAC TGCGGAAGAT GAAAACGTAG 2500 

TGGGTCTGCA ACGTGTTGCT GAACCGGTTG ACGAGGAAGA TCTGGATACC 2550 

ATCGACGGCA GTGCCGCGGA AGGGGACGAT GAAATCGCTC CGGAAGTGGA 2600 

CGTTGACGAC GAGCCAGAAG AAGAATAA 2628 

50 

2) INFORMATION FOR SEQ ID NO: 1301 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1301 
GTIMGIAWIM GICCIGSIAT GTA 



2) INFORMATION FOR SEQ ID NO: 1302 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1302 
TAIADIGGIG GIKKIGCIAT RTA 



2) INFORMATION FOR SEQ ID NO: 1303 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1303 
GGIGAIGAID YIMGIGARGG 



2) INFORMATION FOR SEQ ID NO: 1304 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1304 
CIARYTTIKY ITTIGTYTG 



2) INFORMATION FOR SEQ ID NO: 1305 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1305 
ATGGTGACTG CATTGTCAGA TG 



2) INFORMATION FOR SEQ ID NO: 1306 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1306 
GTCTACGGTT TTCTACAACG TC 



2) INFORMATION FOR SEQ ID NO: 1307 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1923 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1307 

5 ATGGTGACTG CATTGTCAGA TGTAAACAAC ACGGATAATT ATGGTGCTGG 50 

GCAAATACAA GTATTAGAAG GTTTAGAAGC AGTACGTAAA AGACCAGGTA 100 

TGTATATAGG ATCGACTCAG AGAGAGTTGC ACATTAGTGT GGAAATTGTC 150 

GATAATAGTA TCGATGAAGC ATTAGCTGGT TATGCAAATA AAATTGAAGT 200 

TGTTATTGAA AAAGATAACT GGATTAAAGT AACGGATAAC GGACGTGGTA 250 

10 TCCCAGTTGA TATTCAAGAA AAAATGGGAC GTCCAGCTGT CGAAGTTATT 300 

TTAACTGTTT TACATGCTGG TGGTAAATTC GGCGGTGGCG GATACAAAGT 350 

ATCTGGTGGT TTACATGGTG TTGGTTCATC AGTTGTAAAC GCATTGTCAC 4 00 

AAGACTTAGA AGTATATGTA CACAGAAATG AGACTATATA TCATCAAGCA 4 50 

TATAAAAAAG GTGTACCTCA ATTTGACTTA AAAGAAGTTG GCACAACTGA 500 

15 TAAGACAGGT ACTGTCATTC GTTTTAAAGC AGATGGAGAA ATCTTCACAG 550 

AGACAACTGT ATACAACTAT GAAACATTAC AGCAGCGTAT TAGAGAGCTT 600 

GCTTTCTTAA ACAAAGGAAT TCAAATCACA TTAAGAGATG AACGTGATGA 650 

AGAAAACGTT AGAGAAGACT CCTATCACTA TGAGGGCGGT ATTAAATCGT 700 

ACGTTGAGTT ATTGAACGAA AATAAAGAAC CTATTCATGA TGAGCCAATT 750 

20 TATATTCATC AATCTAAAGA TGATATTGAA GTAGAAATTG CGATTCAATA 800 

TAACTCAGGA TATGCCACAA ATCTTTTAAC TTACGCAAAT AACATTCATA 850 

CGTACGAAGG TGGTACGCAT GAAGACGGAT TCAAACGTGC ATTAACGCGT 900 

GTCTTAAATA GTTATGGTTT AAGTAGCAGA TATGAAGAAG AAAAGATAGC 950 

TTCTGGTGAA GATACACGAG AAGGTATGAC AGCAATTATA TCTATCAAAC 1000 

25 ATGGTGATCC TCAATTCGAA GGTCAAACGA AGACAAAATT AGGTAATTCT 1050 

GAAGTGCGTC AAGTTGTAGA TAAATTATTC TCAGAGCACT TTGAACGATT 1100 

TTTATATGAA AATCCACAAG TCGCACGTAC AGTGGTTGAA AAAGGTATTA 1150 

TGGCGGCACG TGCACGTGTT GCTGCGAAAA AAGCGCGTGA AGTAACACGT 1200 

CGTAAATCAG CGTTAGATGT AGCAAGTCTT CCAGGTAAAT TAGCCGATTG 1250 

30 CTCTAGTCAA AGTCCTGAAG AATGTGAGAT TTTCTTAGTC GAAGGGGACT 1300 

CTGCCGGAGG GTCTACAAAA TCTGGTCGTG ACTCTAGAAC GCAGGCGATT 1350 

TTACCATTAC GAGGTAAGAT ATTAAATGTT GAAAAAGCAC GATTAGATAG 14 00 

AATTTTGAAT AACAATGAAA TTCGTCAAAT GATCACAGCA TTTGGTACAG 14 50 

GAATCGGTGG CGACTTTGAT CTAGCGAAAG CAAGATATCA CAAAATCGTC 1500 

35 ATTATGACTG ATGCCGATGT GGATGGAGCG CATATTAGAA CATTGTTATT 1550 

AATATTCTTC TATCGATTTA TGAGACCGTT AATTGAAGCA GGCTATGTGT 1600 

ATATTGCACA GCCACCGTTG TATAAACTGA CACAAGGTAA ACAAAAGTAT 1650 

TATGTATACA ATGATAGGGA ACTTGATAAA CTTAAATCTG AATTGAATCC 1700 

AACACCAAAA TGGTCTATTG CGCTATACAA AGGTCTTGGA GAAATGAATG 1750 

40 CAGATCAATT ATGGGAAACA ACAATGAACC CTGAGCACCG CGCTCTTTTA 1800 

CAAGTAAAAC TTGAAGATGC GATTGAAGCG GACCAAACAT TTGAAATGTT 1850 

AATGGGTGAC GTTGTAGAAA ACCGTAGACA ATTTATAGAA GATAATGCAG 1900 

TTTATGCAAA CTTAGACTTC TAA 1923 



45 



2) INFORMATION FOR SEQ ID NO: 1308 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1308 
ATGTAYGTIA TIATGGAYMG IGC 



2) INFORMATION FOR SEQ ID NO: 1309 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1309 
ATIATYTTRT TICCYTTICC YTT 



2) INFORMATION FOR SEQ ID NO: 1310 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1310 
ATIATITSIA TIACYTCRTC 



2) INFORMATION FOR SEQ ID NO: 1311 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1311 
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2) INFORMATION FOR SEQ ID NO: 1312 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1312 
AARTAYATIA TICARGARMG IGC 



2) INFORMATION FOR SEQ ID NO: 1313 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1313 
AMIAYICKRT GIGGITTITT YTT 



2) INFORMATION FOR SEQ ID NO: 1314 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1314 
TAIGAITTYA CIGAISMICA RGC 



2) INFORMATION FOR SEQ ID NO: 1315 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1315 
ACIATIGCIT CIGCYTGIKS YTC 



2) INFORMATION FOR SEQ ID NO: 1316 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: . Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1316 
GTGAGTGAAA TAATTCAAGA TT 



2) INFORMATION FOR SEQ ID NO: 1317 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1317 
CACCAAAATC ATCTGTATCT AC 



2) INFORMATION FOR SEQ ID NO: 1318 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1318 
ACCTAYTCSA TGTACGTRAT CATGGA 



2) INFORMATION FOR SEQ ID NO: 1319 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1319 
AGRTCGTCIA CCATCGGYAG YTT 



2) INFORMATION FOR SEQ ID NO: 1320 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2259 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: K-12 MG1655 

(C) ACCESSION NUMBER: AE000384 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1320 

ATGAGCGATA TGGCAGAGCG CCTTGCGCTA CATGAATTTA CGGAAAACGC 50 

CTACTTAAAC TACTCCATGT ACGTGATCAT GGACCGTGCG TTGCCGTTTA 100 

TTGGTGATGG TCTGAAACCT GTTCAGCGCC GCATTGTGTA TGCGATGTCT 150 

GAACTGGGCC TGAATGCCAG CGCCAAATTT AAAAAATCGG CCCGTACCGT 200 

CGGTGACGTA CTGGGTAAAT ACCATCCGCA CGGCGATAGC GCCTGTTATG 250 

AAGCGATGGT CCTGATGGCG CAACCGTTCT CTTACCGTTA TCCGCTGGTT 300 

GATGGTCAGG GGAACTGGGG CGCGCCGGAC GATCCGAAAT CGTTCGCGGC 350 

AATGCGTTAC ACCGAATCCC GGTTGTCGAA ATATTCCGAG CTGCTATTGA 400 

GCGAGCTGGG GCAGGGGACG GCTGACTGGG TGCCAAACTT CGACGGCACT 450 

TTGCAGGAGC CGAAAATGCT ACCTGCCCGT CTGCCAAACA TTTTGCTTAA 500 

713 

SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CAOO/01150 

CGGCACCACC GGTATTGCCG TCGGCATGGC GACCGATATT CCACCGCATA 550 

ACCTGCGTGA AGTGGCTCAG GCGGCAATCG CATTAATCGA CCAGCCGAAA 600 

ACCACGCTCG ATCAGCTGCT GGATATCGTG CAGGGGCCGG ATTATCCGAC 650 

TGAAGCGGAA ATTATCACTT CGCGCGCCGA GATCCGTAAA ATCTACGAGA 700 

5 ACGGACGTGG TTCAGTGCGT ATGCGCGCGG TGTGGAAGAA AGAAGATGGC 750 

GCGGTGGTTA TCAGCGCATT GCCGCATCAG GTTTCAGGTG CGCGCGTACT 800 

GGAGCAAATT GCTGCGCAAA TGCGCAACAA AAAGCTGCCG ATGGTTGACG 850 

ATCTGCGCGA TGAATCTGAC CACGAGAACC CGACCCGCCT GGTGATTGTG 900 

CCGCGTTCCA ACCGCGTGGA TATGGATCAG GTGATGAACC ACCTCTTCGC 950 

10 TACCACCGAT CTGGAAAAGA GCTATCGTAT TAACCTTAAT ATGATCGGTC 1000 

TGGATGGTCG TCCGGC.GGTG AAAAACCTGC TGGAAATCCT CTCCGAATGG 1050 

CTGGTGTTCC GCCGCGATAC CGTGCGCCGC CGACTGAACT ATCGTCTGGA 1100 

GAAAGTCCTC AAGCGCCTGC ATATCCTCGA AGGTTTGCTG GTGGCGTTTC 1150 

TCAATATCGA CGAAGTGATT GAGATCATTC GTAATGAAGA TGAACCGAAA 1200 

15 CCGGCGCTGA TGTCGCGGTT TGGCCTTACG GAAACCCAGG CGGAAGCGAT 1250 

CCTCGAACTG AAACTGCGTC ATCTTGCCAA ACTGGAAGAG ATGAAGATTC 1300 

GCGGTGAGCA GAGTGAACTG GAAAAAGAGC GCGACCAGTT GCAGGGCATT 1350 

TTGGCTTCCG AGCGTAAAAT GAATAACCTG CTGAAGAAAG AACTGCAGGC 1400 

AGACGCGCAA GCCTACGGTG ACGATCGTCG TTCGCCGTTG CAGGAACGCG 1450 

20 AAGAAGCGAA AGCGATGAGC GAGCACGACA TGCTGCCGTC TGAACCTGTC 1500 

ACCATTGTGC TGTCGCAGAT GGGCTGGGTA CGCAGCGCTA AAGGCCATGA 1550 

TATCGACGCG CCGGGCCTGA ATTATAAAGC GGGTGATAGC TTCAAAGCGG 1600 

CGGTGAAAGG TAAGAGCAAC CAACCGGTAG TGTTTGTTGA TTCCACCGGT 1650 

CGTAGCTATG CCATTGACCC GATTACGCTG CCGTCGGCGC GTGGTCAGGG 1700 

25 CGAGCCGCTC ACCGGCAAAT TAACGTTGCC GCCTGGGGCG ACCGTTGACC 1750 

ATATGCTGAT GGAAAGCGAC GATCAGAAAC TGCTGATGGC TTCCGATGCG 1800 

GGTTACGGTT TCGTCTGCAC CTTTAACGAT CTGGTGGCGC GTAACCGTGC 1850 

AGGTAAGGCT TTGATCACCT TACCGGAAAA TGCCCATGTT ATGCCGCCGG 1900 

TGGTGATTGA AGATGCTTCC GATATGCTGC TGGCAATCAC TCAGGCAGGC 1950 

30 CGTATGTTGA TGTTCCCGGT AAGTGATCTG CCGCAGCTGT CGAAGGGCAA 2000 

AGGCAACAAG ATTATCAACA TTCCATCGGC AGAAGCCGCG CGTGGAGAAG 2050 

ATGGTCTGGC GCAATTGTAC GTTCTGCCGC CGCAAAGCAC GCTGACCATT 2100 

CATGTTGGGA AACGCAAAAT TAAACTGCGC CCGGAAGAGT TACAGAAAGT 2150 

CACTGGCGAA CGTGGACGCC GCGGTACGTT GATGCGCGGT TTGCAGCGTA 2200 

35 TCGATCGTGT TGAGATCGAC TCTCCTCGCC GTGCCAGCAG CGGTGATAGC 2250 

GAAGAGTAA 2259 



40 2) INFORMATION FOR SEQ ID NO: 1321 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2403 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: KMP9 

(C) ACCESSION NUMBER: D67074 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1321 

GTGAGTGAAA TAATTCAAGA TTTATCACTT GAAGATGTTT TAGGTGATCG 50 

5 CTTTGGAAGA TATAGTAAAT ATATTATTCA AGAGCGTGCA TTGCCAGATG 100 

TTCGTGATGG TTTAAAACCA GTACAACGTC GTATTTTATA TGCAATGTAT 150 

TCAAGTGGTA ATACACACGA TAAAAATTTC CGTAAAAGTG CGAAAACAGT 200 

CGGTGATGTT ATTGGTCAAT ATCATCCACA TGGAGACTTC TCAGTGTACA 250 

AAGCAATGGT CCGTTTAAGT CAAGACTGGA AGTTACGACA TGTCTTAATA 300 

10 GAAATGCATG GTAATAATGG TAGTATCGAT AATGATCCGC CAGCGGCAAT 350 

GCGTTACACT GAAGCTAAGT TAAGCTTACT AGCTGAAGAG TTATTACGTG 400 

ATATTAATAA AGAGACAGTT TCTTTCATTC CAAACTATGA TGATACGACA 450 

CTCGAACCAA TGGTATTGCC ATCAAGATTT CCTAACTTAC TAGTGAATGG 500 

TTCTACAGGT ATATCTGCAG GTTACGCGAC AGATATACCA CCACATAATT 55Q 

15 TAGCTGAAGT GATTCAAGCA ACACTTAAAT ATATTGATAA TCCGGATATT 600 

ACAGTCAATC AATTAATGAA AT AT ATT AAA GGTCCTGATT TTCCAACTGG 650 

TGGTATTATT CAAGGTATTG ATGGTATTAA AAAAGCTTAT GAATCAGGTA 700 

AAGGTAGAAT TATAGTTCGT TCTAAAGTTG AAGAAGAAAC TTTACGCAAT 750 

GGACGTAAAC AGTTAATTAT TACTGAAATT CCATATGAAG TGAACAAAAG 800 

2 0 TAGCTTAGTA AAACGTATCG ATGAATTACG TGCTGACAAA AAAGTCGATG 850 

GTATCGTTGA AGTACGTGAT GAAACTGATA GAACTGGTTT ACGAATAGCA 900 

ATTGAATTGA AAAAAGATGT GAACAGTGAA TCAATCAAAA ATTATCTTTA 950 

TAAAAACTCT GATTTACAGA TTTCATATAA TTTCAACATG GTCGCTATTA 1000 

GTGATGGTCG TCCAAAATTG ATGGGTATTC GTCAAATTAT AGATAGTTAT 1050 

25 TTGAATCATC AAATTGAGGT TGTTGCAAAT AGAACGAAGT TTGAATTAGA 1100 

TAATGCTGAA AAACGTATGC ATATCGTTGA AGGTTTGATT AAAGCGTTGT 1150 

CAATTTTAGA TAAAGTAATT GAATTGATTC GTAGCTCTAA AAACAAGCGT 1200 

GACGCTAAAG AAAACCTTAT CGAAGTATTC GAGTTCACAG AAGAACAGGC 1250 

TGAAGCAATT GTAATGTTAC AGTTATATCG TTTAACAAAC ACTGACATAG 1300 

3 0 TTGCGCTTGA AGGTGAACAT AAAGAACTTG AAGCATTAAT CAAACAATTA 1350 

CGTCATATTC TTGATAACCA TGATGCATTA TTGAATGTCA TAAAAGAAGA 1400 

ATTGAATGAA ATTAAAAAGA AATTCAAATC TGAACGACTG TCTTTAATTG 14 50 

AAGCAGAAAT TGAAGAAATT AAAATTGACA AAGAAGTTAT GGTGCCTAGT 1500 

GAAGAAGTTA TTTTAAGTAT GACACGTCAT GG AT AT ATT A AACGTACTTC 1550 

35 TATTCGTAGC TTTAATGCTA GCGGTGTTGA GGATATTGGT TTAAAAGATG 1600 

GTGACAGTTT ACTTAAACAT CAAGAAGTAA ATACGCAAGA TACCGTACTA 1650 

GTATTTACAA ATAAAGGTCG TTATCTATTT ATACCGGTTC ATAAATTAGC 17 00 

AGATATTCGT TGGAAAGAAT TGGGGCAACA TGTATCACAA ATAGTTCCTA 17 50 

TCGAAGAAGA TGAAGTGGTT ATTAATGTCT TTAATGAAAA GGACTTTAAT 1800 

40 ACAGATGCAT TTTATGTTTT TGCGACTCAA AATGGCATGA TTAAGAAAAG 1850 

TACAGTGCCT CTATTTAAAA CAACGCGTTT TAATAAACCT TTAATTGCTA 1900 

CTAAAGTTAA AGAAAATGAT GATTTGATTA GTGTTATGCG CTTTGAAAAA 1950 

GATCAATTAA TTACCGTCAT TACTAATAAA GGTATGTCAT TAACGTATAA 2000 

TACAAGTGAA CTATCAGATA CCGGATTAAG GGCAGCTGGT GTTAAATCAA 2050 

45 TAAATCTTAA AGCTGAAGAT TTCGTTGTTA TGACAGAAGG TGTTTCTGAA 2100 

AATGATACTA TATTGATGGC CACACAACGC GGCTCGTTAA AACGTATTAG 2150 

TTTTAAAATC TTACAAGTTG CTAAAAGAGC ACAACGTGGA ATAACTTTAT 2200 

TAAAAGAATT AAAGAAAAAT CCACATCGTA TTGTAGCTGC ACATGTAGTG 22 50 

ACAGGTGAAC ATAGTCAATA TACATTATAT TCAAAATCAA ATGAAGAACA 2300 

50 TGGTTTAATT AATGATATTC ATAAATCTGA ACAATATACA AATGGCTCAT 2350 

TCATTGTAGA TACAGATGAT TTTGGTGAAG TAATAGACAT GTATATTAGC 24 00 

TAA 2403 
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2) INFORMATION FOR SEQ ID NO: 1322 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1322 
RTIGAIAAYI SIGTIGAYGA RG 



2) INFORMATION FOR SEQ ID NO: 1323 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1323 
ACIAWRSAIG GIGGIACICA YG 



2) INFORMATION FOR SEQ ID NO: 1324 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1324 
CCICCIGCIS WRTCICCYTC 



2) INFORMATION FOR SEQ ID NO: 1325 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1325 
RTTCATYTCI CCIARICCYT T 



2) INFORMATION FOR SEQ ID NO: 1326 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1326 
TGATTCAATA CAGGTTTTAG AG 



2) INFORMATION FOR SEQ ID NO: 1327 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1327 
CTAGATTTCC TCCTCATCAA AT 



2) INFORMATION FOR SEQ ID NO: 1328 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1992 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: RN4220 

(C) ACCESSION NUMBER: D67075 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1328 

ATGAATAAAC AAAATAATTA TTCAGATGAT TCAATACAGG TTTTAGAGGG 50 

GTTAGAAGCA GTTCGTAAAA GACCTGGTAT GTATATTGGA TCAACTGATA 100 

10 AACGGGGATT ACATCATCTA GTATATGAAA TTGTCGATAA CTCCGTCGAT 150 

GAAGTATTGA ATGGTTACGG TAACGAAATA GATGTAACAA TTAATAAAGA 200 

TGGTAGTATT TCTATAGAAG ATAATGGACG TGGTATGCCA ACAGGTATAC 250 

ATAAATCAGG TAAACCGACA GTCGAAGTTA TCTTTACTGT TTTACATGCA 300 

GGAGGTAAAT TTGGACAAGG CGGCTATAAA ACTTCAGGTG GTCTTCACGG 350 

15 TGTTGGTGCT TCAGTTGTAA ATGCATTGAG TGAATGGCTT GAAGTTGAAA 4 00 

TCCATCGAGA TGGTAATATA TATCATCAAA GTTTTAAAAA CGGTGGTTCG 450 

CCATCTTCTG GTTTAGTGAA AAAAGGTAAA ACTAAGAAAA CAGGTACCAA 500 

AGTAACATTT AAACCTGATG ACACAATTTT TAAAGCATCT ACATCATTTA 550 

ATTTTGATGT TTTAAGTGAA CGACTACAAG AGTCTGCGTT CTTATTGAAA 600 

20 AATTTAAAAA TAACGCTTAA TGATTTACGC AGTGGTAAAG AGCGTCAAGA 650 

GCATTACCAT TATGAAGAAG GAATCAAAGA GTTTGTTAGT TATGTCAATG 700 

AAGGAAAAGA AGTTTTGCAT GACGTGGCTA CATTTTCAGG TGAAGCAAAT 750 

GGTATAGAGG TAGACGTAGC TTTCCAATAT AATGATCAAT ATTCAGAAAG 800 

TATTTTAAGT TTTGTAAATA ATGTACGTAC TAAAGATGGT GGTACACATG 850 

25 AAGTTGGTTT TAAAACAGCA ATGACACGTG TATTTAATGA TTATGCACGT 900 

CGTATTAATG AACTTAAAAC AAAAGATAAA AACTTAGATG GTAATGATAT 950 

TCGTGAAGGT TTAACAGCTG TTGTGTCTGT TCGTATTCCA GAAGAATTAT 1000 

TGCAATTTGA AGGACAAACG AAATCTAAAT TGGGTACTTC TGAAGCTAGA 1050 

AGTGCTGTTG ATTCAGTTGT TGCAGACAAA TTGCCATTCT ATTTAGAAGA 1100 

3 0 AAAAGGACAA TTGTCTAAAT CACTTGTGAA AAAAGCGATT AAAGCACAAC 1150 

AAGCAAGGGA AGCTGCACGT AAAGCTCGTG AAGATGCTCG TTCAGGTAAG 1200 

AAAAACAAGC GTAAAGACAC TTTGCTATCT GGTAAATTAA CACCTGCACA 1250 

AAGTAAAAAC ACTGAAAAAA ATGAATTGTA TTTAGTCGAA GGTGATTCTG 1300 

CGGGAGGTTC AGCAAAACTT GGACGAGACC GCAAATTCCA AGCGATATTA 1350 

35- CCATTACGTG GTAAGGTAAT TAATACAGAG AAAGCACGTC TAGAAGATAT 14 00 

TTTTAAAAAT GAAGAAATTA ATACAATTAT CCACACAATC GGGGCAGGCG 14 50 

TTGGTACTGA CTTTAAAATT GAAGATAGTA ATTATAATCG TGTAATTATT 1500 

ATGACTGATG CTGATACTGA TGGTGCGCAT ATTCAAGTGC TATTGTTAAC 1550 

ATTCTTCTTC AAATATATGA AACCGCTTGT TCAAGCAGGT CGTGTATTTA 1600 

4 0 TTGCTTTACC TCCACTTTAT AAATTGGAAA AAGGTAAAGG CAAAACAAAG 1650 

CGAGTTGAAT ACGCTTGGAC AGACGAAGAG CTTAATAAAT TGCAAAAAGA 1700 

ACTTGGTAAA GGCTTCACGT TACAACGTTA CAAAGGTTTG GGTGAAATGA 1750 

ACCCTGAGCA ATTATGGGAA ACGACGATGA ACCCAGAAAC ACGAACTTTA 1800 

ATTCGTGTAC AAGTTGAAGA TGAAGTGCGT TCATCTAAAC GTGTAACAAC 1850 

45 ATTAATGGGT GACAAAGTAC AACCTAGACG TGAATGGATT GAAAAGCATG 1900 

TTGAGTTTGG TATGCAAGAG GACCAAAGTA TTTTAGATAA TTCTGAAGTA 1950 

CAAGTGCTTG AAAATGATCA ATTTGATGAG GAGGAAATCT AG 1992 



50 

2) INFORMATION FOR SEQ ID NO: 1329 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1329 
10 TGTAGAGCGC GGTATCATCA AAGTA 25 



15 



2) INFORMATION FOR SEQ ID NO: 1330 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1330 

25 

AGATTCGAAC TTGGTGTGCG GG 22 



30 2) INFORMATION FOR SEQ ID NO: 1331 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1331 

GCCCTTGAGG TACAGAATGG TAATGAAGTT 30 



45 

2) INFORMATION FOR SEQ ID NO: 1332 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1332 
GACCGCGGCG CAGACCATCA 



2) INFORMATION FOR SEQ ID NO: 1333 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1333 

* 

TCATGGTGAC TTATCTATTT ATG 



2) INFORMATION FOR SEQ ID NO: 1334 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1334 
CATCTATTTA TAAAGCAATG GTA 



2) INFORMATION FOR SEQ ID NO: 1335 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1335 
CTATTTATGG AGCAATGGT 
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2) INFORMATION FOR SEQ ID NO: 1336 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1336 
TGGAGACTAC TCAGTGT 



2) INFORMATION FOR SEQ ID NO: 1337 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1337 
TGGAGACTTC TCAGTGT 



2) INFORMATION FOR SEQ ID NO: 1338 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1338 
GTGTACGGAG CAATG 



2) INFORMATION FOR SEQ ID NO: 1339 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1339 
CCAGCGGAAA TGCGT 15 



2) INFORMATION FOR SEQ ID NO: 1340 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1340 
GAACAAGGTA TGACACCGGA TAAAT 25 

25 

2) INFORMATION FOR SEQ ID NO: 1341 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1341 
40 GATAACTGAA ATCCTGAGCC ATACG 25 



2) INFORMATION FOR SEQ ID NO: 1342 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1342 

GATGTTATTG GTCAATATCA TCCA 24 



2) INFORMATION FOR SEQ ID NO: 1343 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 29 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1343 
AAGAAACTGT CTCTTTATTA ATATCACGT 29 

20 

2) INFORMATION FOR SEQ ID NO: 1344 

2 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1344 
35 AGCAGCAACG ATGTTACGCA GCAG 24 



2) INFORMATION FOR SEQ ID NO: 1345 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1345 

50 

CCCGCCGAGC ATTTCAACTA TTG 23 
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2) INFORMATION FOR SEQ ID NO: 1346 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1346 
GATGTTACGC AGCAGGGCAG TC 



2) INFORMATION FOR SEQ ID NO: 1347 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1347 
ACCAAGCAGG TTCGCAGTCA AGTA 



2) INFORMATION FOR SEQ ID NO: 1348 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Unidentified bacterium 
(C) ACCESSION NUMBER: X04555 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1348 

ATGCGCTCAC GCAACTGGTC CAGAACCTTG ACCGAACGCA GCGGTGGTAA 50 

CGGCGCAGTG GCGGTTTTCA TGGCTTGTTA TGACTGTTTT TTTGTACAGT 100 

CTATGCCTCG GGCATCCAAG CAGCAAGCGC GTTACGCCGT GGGTCGATGT 150 

TTGATGTTAT GGAGCAGCAA CGATGTTACG CAGCAGGGCA GTCGCCCTAA 200 

AACAAAGTTA GGCCGCATGG ACACAACGCA GGTCACATTG ATACACAAAA 250 
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TTCTAGCTGC GGCAGATGAG CGAAATCTGC 

TGGGCGATCG ATGCACGGCT AGGGCGTGTA 

TGATCTGACG TTTCCCGGCG AGAGGCGCGG 

AAATGCTCGG CGGGCGCGTC ATGGAGGAGT 

5 GAGATCGGGG ATGAGTTACT TGACTGCGAA 

AGCGTATGAA ATCGCGGAGG CTCCGCAGGG 

AGGGCGTCAT CGCCGGGCGG CCAGTCCGTT 

ATCTGGGATT ACTTTTACTA TGCCGATGAA 

TACAAAGCAC ATAGAGTCCT ACAGGCTCGC 

10 AAAAGGTTGA GGTCTTGCGT GCCGCTTTCA 



PCT/CAOO/01150 

CGCTCTGGAT CGGTGGGGGC 300 

ACACGCAAGC ACGATGATAT 350 

CGAGCTCGAG GCAATAGTTG 400 

TGGACTATGG ATTCTTAGCG 450 

CCTGCTTGGT GGGCAGACGA 500 

CTCGTGCCCA GAGGCGGCTG 550 

GTAACAGCTG GGAGGCGATC 600 

GTACCACCAG TGGACTGGCC 650 

ATGCACCTCA CTCGGGGCGG 700 

GGTCGCGATA TGCGGCCTAA 750 



2) INFORMATION FOR SEQ ID NO: 1349 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1349 

25 

CAGCCGACCA ATGAGTATCT TGCC 24 



30 2) INFORMATION FOR SEQ ID NO: 1350 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1350 

TAATCAGGGC AGTTGCGACT CCTA 24 



45 

2) INFORMATION FOR SEQ ID NO: 1351 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 531 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



725 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 
5 (B) STRAIN: Stone 130 

(C) ACCESSION NUMBER: L06157 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1351 

10 ATGTTATGGA GCAGCAACGA TGTTACGCAG CAGGGCAGTC GCCCTAAAAC 50 

AAAGTTAGGT GGCTCAATGA GCATCATTGC AACCGTCAAG ATCGGCCCTG 100 

ACGAAATTTC AGCCATGAGG GCTGTGCTCG ATCTCTTCGG CAAAGAGTTT 150 

GAGGACATTC CAACCTACTC TGATCGCCAG CCGACCAATG AGTATCTTGC 200 

CAATCTTCTG CACAGCGAGA CGTTCATCGC GCTCGCTGCT TTTGACCGCG 250 

15 GAACAGCAAT AGGTGGGCTC GCCGCCTACG TTCTACCCAA GTTCGAGCAA 300 

GCGCGAAGCG AGATCTACAT TTATGACTTG GCAGTCGCTT CCAGCCATCG 350 

AAGGCTAGGA GTCGCAACTG CCCTGATTAG CCACCTGAAG CGTGTGGCGG 4 00 

TTGAACTTGG CGCGTATGTA ATCTATGTGC AAGCAGACTA CGGTGACGAT 4 50 

CCGGCAGTCG CTCTCTACAC AAAGCTTGGA GTTCGGGAAG ACGTCATGCA 500 

20 CTTCGACATT GATCCAAGAA CCGCCACCTA A 531 



25 



2) INFORMATION FOR SEQ ID NO: 1352 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1352 

35 

CCACGCTGAC AGAGCCGCAC CG 22 



40 2) INFORMATION FOR SEQ ID NO: 1353 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1353 

GGCCAGCTCC CATCGGACCC TG 22 
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2) INFORMATION FOR SEQ ID NO: 1354 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1354 
CACGCTGACA GAGCCGCACC G 2 



2) INFORMATION FOR SEQ ID NO: 1355 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1355 
ATGCCGTTGC TGTCGAAATC CTCG 2 



2) INFORMATION FOR SEQ ID NO: 1356 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia marcescens 
(C) ACCESSION NUMBER: M97172 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1356 

ATGAACACGA TCGAATCGAT CACGGCGGAC CTGCACGGAC TGGGCGTCCG 50 

GCCCGGCGAC CTGATCATGG TCCATGCATC GCTGAAAGCC GTCGGCCCGG 100 

TCGAGGGAGG TGCGGCCTCG GTGGTGTCGG CCCTTCGCGC CGCGGTCGGG 150 

TCCGCAGGGA CCCTGATGGG TTATGCCTCA TGGGACCGCT CGCCCTATGA 200 

GGAGACGCTG AACGGCGCGC GGATGGACGA AGAACTGCGC CGCCGGTGGC 250 
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CACCCTTCGA TCTGGCCACA TCCGGTACCT ATCCCGGCTT CGGCCTGCTC 300 

AACCGGTTTC TGCTTGAGGC GCCCGACGCA CGGCGCAGCG CGCATCCCGA 350 

CGCCTCCATG GTCGCGGTCG GCCCCCTTGC CGCCACGCTG ACAGAGCCGC 400 

ACCGGCTTGG GCAGGCGCTG GGCGAAGGCT CGCCGCTGGA GCGCTTCGTC 4 50 

5 GGGCATGGCG GAAAGGTCCT GCTTCTGGGA GCGCCGCTCG ACTCCGTCAC 500 

CGTGCTGCAT TACGCCGAGG CCATCGCCCC CATCCCGAAC AAACGCCGCG 550 

TGACCTATGA AATGCCGATG CTCGGCCCGG ATGGCAGGGT CCGATGGGAG 600 

CTGGCCGAGG ATTTCGACAG CAACGGCATT CTCGATTGCT TCGCGGTCGA 650 

TGGGAAGCCG GATGCCGTCG AGACGATCGC CAAGGCTTAT GTCGAACTGG 700 

10 GCCGGCATCG GGAAGGCATC GTCGGTCGCG CACCCTCCTA TCTGTTTGAA 750 

GCGCAGGATA TCGTCTCGTT CGGCGTCACC TATCTCGAAC AGCATTTCGG 800 

CGCGCCCTGA 810 



15 



25 



30 



2) INFORMATION FOR SEQ ID NO: 1357 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 20 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1357 
GCCCATCCAT TTGCCTTTGC 20 



2) INFORMATION FOR SEQ ID NO: 1358 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

4 0 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1358 
GCGTACCAAC TTGCCATCCT GAAG 24 

45 



2) INFORMATION FOR SEQ ID NO: 1359 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 
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10 



20 



25 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1359 
TGCCCCTGCC ACCTCACTC 19 



2) INFORMATION FOR SEQ ID NO: 1360 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1360 
CGTACCAACT TGCCATCCTG AAGA 24 



2) INFORMATION FOR SEQ ID NO: 1361 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 786 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: X01385 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1361 



GTGCAATACG AATGGCGAAA AGCCGAGCTC 
TGGGGTTACC CCCGGCGGTG TGCTGCTGGT 

45 TCCGGCCCCT CGAAGATGGG CCACTTGGAC 
GCGCTGGGTC CGGGAGGGAC GCTCGTCATG 
CGACGAGCCG TTCGATCCTG CCACGTCGCC 
TTGTCTCTGA CACATTCTGG CGCCTGCCAA 
CCATTTGCCT TTGCGGCAGC GGGGCCACAG 

50 TCCATTGCCC CTGCCACCTC ACTCGCCTGC 
ATGAACTCGA TGGGCAGGTA CTTCTCCTCG 
ACGACGCTGC ATCTTGCCGA GTTGATGGCA 
GAGACACTGC ACCATTCTTC AGGATGGCAA 



ATCGGTCAGC TTCTCAACCT 50 

CCACAGCTCC TTCCGTAGCG 100 

TGATCGAGGC CCTGCGTGCT 150 

CCCTCGTGGT CAGGTCTGGA 200 

CGTTACACCG GACCTTGGAG 250 

ATGTAAAGCG CAGCGCCCAT 300 

GCAGAGCAGA TCATCTCTGA 350 

AAGCCCGGTC GCCCGTGTCC 400 

GCGTGGGACA CGATGCCAAC 4 50 

AAGGTTCCCT ATGGGGTGCC 500 

GTTGGTACGC GTCGATTATC 550 
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TCGAGAATGA CCACTGCTGT 

AAGGAGAAGA GCCTTCAGAA 

GTTGATCCGC TCCCGCGACA 

GAGATCCGTT GATCTTCCTG 

5 CGATGCCGCT CGCCAGTCGA 



i 

GAGCGCTTTG CCTTGGCGGA 
GGAAGGTCCA GTCGGTCATG 
TTGTGGCGAC AGCCCTGGGT 
CATCCGCCAG AGGGCGGGAT 
TTGGCTGAGC TCATGA 
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CAGGTGGCTC 600 
CCTTTGCTCG 650 
CAACTGGGCC 700 
GCGAAGAATG 750 

786 



2) INFORMATION FOR SEQ ID NO: 1362 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1362 

20 

CGCCGCCATC GCCCAAAGCT GG .22 



25 2) INFORMATION FOR SEQ ID NO: 1363 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1363 

CGGCATAATG GAGCGCGGTG ACTG 24 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1364 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1364 

TTTCTCGCCC ACGCAGGAAA AATC 24 
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2) INFORMATION FOR SEQ ID NO: 1365 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1365 
CATCCTCGAC GAATATGCCG CG 



2) INFORMATION FOR SEQ ID NO: 1366 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterobacter cloacae 
(C) ACCESSION NUMBER: M88012 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1366 

ATGACTGATC CCCGCAAAAA CGGCGATTTG CACGAACCCG CGACGGCACC 50 

CGCGACGCCC TGGTCCAAAA GCGAGCTGGT CCGGCAATTG CGCGACCTCG 100 

GCGTGCGCTC AGGCGATATG GTGATGCCGC ATGTGTCGTT GCGCGCCGTC 150 

GGGCCGCTGG CGGACGGACC GCAGACACTT GTCGATGCGC TGATCGAGGC 200 

CGTCGGCCCC ACCGGGAATA TTCTCGCCTT CGTCTCGTGG CGCGATTCGC 250 

CCTATGAACA GACGCTGGGT CATGATGCGC CGCCCGCCGC CATCGCCCAA 300 

AGCTGGCCTG CGTTCGACCC CGACCATGCG CCCGCCTACC CCGGCTTTGG 350 

CGCGATCAAC GAATTTATCC GAACCTATCC GGGGTGTCGG CGCACGGCCC 400 

ATCCCGACGC ATCGATGGCG GCGATCGGGC CCGATGCGGC GTGGCTGGTG 4 50 

GCGCCGCACG AGATGGGCGC CGCTTATGGC CCCCGCTCGC CGATCGCGCG 500 

TTTTCTCGCC CACGCAGGAA AAATCCTGTC GATCGGCGCC GGGCCCGATG 550 

CAGTCACCGC GCTCCATTAT GCCGAAGCGG TGGCGCGGAT CGAGGGCAAG 600 

CGCCGCGTCA CTTATTCGAT GCCCTTACTG CGCGAAGGCA AGCGCGTCTG 650 

GGTCACCACG TCCGACTGGG ATTCGAACGG CATCCTCGAC GAATATGCCG 700 

CGCCCGACGG CCCCGACGCG GTCGAACGGA TCGCCCGCGA CTATCTCGCC 7 50 

CGCACCAGGG TTGCGCAAGG CCCGGTCGGC GGCGCGCAAT CCCGGCTGAT 800 

CGACGCGGCC GATATCGTTT CCTTCGGCAT CGAATGGCTC GAGGCGCGCC 850 

ACGCCGCGCC AGCGGCGGCA GCGCTGAAGC CGAAACAACG CCGCGACTGA 900 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1367 
CAAATATACT AACAGAAGCG TTCA 



2) INFORMATION FOR SEQ ID NO: 1368 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1368 
AGGATCTTGC CAATACCTTT AT 



2) INFORMATION FOR SEQ ID NO: 1369 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1369 
AAACCTTTGT TTCGGTCTGC TAAT 



2) INFORMATION FOR SEQ ID NO: 1370 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1370 
AAGCGATTCC AATAATACCT TGCT 



2) INFORMATION FOR SEQ ID NO: 1371 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 558 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter diversus 
(C) ACCESSION NUMBER: M18967 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1371 

ATGAATTATC AAATTGTGAA TATTGCGGAA TGCAGCAATT ATCAGTTAGA 50 

AGCAGCAAAT ATACTAACAG AAGCGTTCAA TGATCTTGGT AACAATTCAT 100 

GGCCAGATAT GACGAGTGCA ACAAAAGAAG TAAAAGAATG TATTGAGAGT 150 

CCAAACCTTT GTTTCGGTCT GCTAATAAAT AACTCCTTAG TTGGCTGGAT 200 

AGGCTTAAGG CCAATGTACA AGGAAACCTG GGAATTGCAT CCATTGGTTG 250 

TCAGACCAGA TTATCAAAAT AAAGGTATTG GCAAGATCCT GCTTAAGGAA 300 

TTAGAAAACA GAGCTAGAGA GCAAGGTATT ATTGGAATCG CTTTAGGAAC 350 

AGATGATGAA TACTATAGAA CAAGTCTCTC TTTAATAACT ATAACAGAAG 400 

ATAATATATT TGATTCAATA AAAAATATTA AAAATATTAA TAAACATCCA 450 

TATGAGTTTT ATCAGAAGAA TGGTTATTAT ATTGTTGGAA TAATTCCAAA 500 

TGCCAATGGT AAAAACAAAC CAGATATTTG GATGTGGAAA AGTTTAATCA 550 

AAGAGTAA 558 



2) INFORMATION FOR SEQ ID NO: 1372 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1372 
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GCTTTCGTTG CCTTTGCCGA GGTC 



2) INFORMATION FOR SEQ ID NO: 1373 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1373 
CACCCCTGTT GCTTCGCCCA CTC 



2) INFORMATION FOR SEQ ID NO: 1374 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1374 
AGATATTGGC TTCGCCGCAC CACA 



2) INFORMATION FOR SEQ ID NO: 1375 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1375 
CCCTGTTGCT TCGCCCACTC CTG 
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2) INFORMATION FOR SEQ ID NO: 1376 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Serratia marcescens 
(C) ACCESSION NUMBER: M94066 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1376 

ATGATCGTCA TCTGCGACCA CGACAACCTC GACGCCTGGC TGGCGCTGCG 50 

CACCGCGCTG TGGCCCTCCG GCTCGCCTGA AGATCACCGC GCGGAAATGC 100 

GCGAGATATT GGCTTCGCCG CACCACACCG CGTTTATGGC GCGGGGGCTG 150 

GACGGCGCTT TCGTTGCCTT TGCCGAGGTC GCGCTGCGCT ACGATTACGT 200 

CAACGGCTGC GAATCGTCGC CGGTGGCGTT TTTGGAAGGA ATTTATACCG 250 

CCGAACGCGC CCGCCGCCAG GGCTGGGCCG CGCGCCTGAT CGCGCAGGTG 300 

CAGGAGTGGG CGAAGCAACA GGGGTGCAGC GAGCTGGCGT CGGATACCGA 350 

TATCGCCAAT CTGGACTCCC AGCGCCTGCA TGCGGCGCTG GGCTTTGCCG 400 

AAACGGAGCG AGTAGTGTTT TACCGCAAAA CGCTGGGCTG A 441 



2) INFORMATION FOR SEQ ID NO: 1377 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1377 
GCCGTGGGTC GATGTTTGAT GTTA 



2) INFORMATION FOR SEQ ID NO: 1378 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1378 
GCTCGATGAC GCCAACTACC TCTG 



2) INFORMATION FOR SEQ ID NO: 1379 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1379 
AGCAGCAACG ATGTTACGCA GCAG 



2) INFORMATION FOR SEQ ID NO: 1380 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1380 
CGCTCGATGA CGCCAACTAC CTCT 



2) INFORMATION FOR SEQ ID NO: 1381 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 972 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: X02340 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1381 



GTGGTAACGG CGCAGTGGCG GTTTTCATGG 
TTTTGGGGTA CAGTCTATGC CTCGGGCATC 
CCGTGGGTCG ATGTTTGATG TTATGGAGCA 
GGCAGTCGCC CTAAAACAAA GTTAAACATC 
CGAAGTATCG ACTCAACTAT CAGAGGTAGT 
TCGAACCGAC GTTGCTGGCC GTACATTTGT 
GGCCTGAAGC CACACAGTGA TATTGATTTG 
GCTTGATGAA ACAACGCGGC GAGCTTTGAT 
CGGCTTCCCC TGGAGAGAGC GAGATTCTCC 
GTTGTGCACG ACGACATCAT TCCGTGGCGT 
GCAATTTGGA GAATGGCAGC GCAATGACAT 
CAGCCACGAT CGACATTGAT CTGGCTATCT 
CATAGCGTTG CCTTGGTAGG TCCAGCGGCG 
TCCTGAACAG GATCTATTTG AGGCGCTAAA 
ACTCGCCGCC CGACTGGGCT GGCGATGAGC 
TCCCGCATTT GGTACAGCGC AGTAACCGGC 
CGCTGCCGAC TGGGCAATGG AGCGCCTGCC 
TACTTGAAGC TAGACAGGCT TATCTTGGAC 
TCGCGCGCAG ATCAGTTGGA AGAATTTGTC 
CACCAAGGTA GTCGGCAAAT AA 



CTTCTTGTTA TGACATGTTT 50 

CAAGCAGCAA GCGCGTTACG 100 

GCAACGATGT TACGCAGCAG 150 

ATGAGGGAAG CGGTGATCGC 200 

TGGCGTCATC GAGCGCCATC 250 

ACGGCTCCGC AGTGGATGGC 300 

CTGGTTACGG TGACCGTAAG 350 

CAACGACCTT TTGGAAACTT 4 00 

GCGCTGTAGA AGTCACCATT 450 

TATCCAGCTA AGCGCGAACT 500 

TCTTGCAGGT ATCTTCGAGC 550 

TGCTGACAAA AGCAAGAGAA 600 

GAGGAACTCT TTGATCCGGT 650 

TGAAACCTTA ACGCTATGGA 700 

GAAATGTAGT GCTTACGTTG 750 

AAAATCGCGC CGAAGGATGT 800 

GGCCCAGTAT CAGCCCGTCA 8 50 

AAGAAGAAGA TCGCTTGGCC 900 

CACTACGTGA AAGGCGAGAT 950 

972 



2) INFORMATION FOR SEQ ID NO: 1382 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1382 
TAGATATGAT AGGCGGTAAA AAGC 



2) INFORMATION FOR SEQ ID NO: 1383 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1383 
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2) INFORMATION FOR SEQ ID NO: 1384 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1384 
GATATGATAG GCGGTAAAAA GC 



2) INFORMATION FOR SEQ ID NO: 1385 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1385 
TCCCAAATTC GAGTAAGAGG TA 



2) INFORMATION FOR SEQ ID NO: 1386 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: V01282 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1386 

ATGAAAGAAA GATATGGAAC AGTATATAAA GGCTCTCAGA GGCTCATAGA 50 
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CGAGGAAAGT GGAGAAGTAA TAGAGGTAGA TAAGCTATAC CGTAAACAAA 100 

CGTCTGGTAA CTTTGTAAAA GCGTATATCG TCCAATTAAT AAGTATGTTA 150 

GATATGATAG GCGGTAAAAA GCTCAAGATT GTTAATTATA TAT TAG AT AA 200 

TGTACATCTA AGTAATAACA CAATGATAGC AACTGTTAGA GAAATAGCAG 250 

AAGGAACAAA TACAAGCACG AAAACCGTAA ATACAACGCT TAAAATCTTA 300 

GAAGAAGGAA ATATCATTAA AAGAAGAACT GGAGCATTAA TGCTAAACCC 350 

AGAGCTACTC ATGAGAGGCG ATGACCAAAA ACAAAAATAC CTCTTACTCG 400 

AATTTGGGAA CTTTGAGCAA GAGGACGACC AAAAGCAAGA AAATGCTTTA 4 50 

TCAGAATATT ATTCTTTCAA GGAGTAG 477 



2) INFORMATION FOR SEQ ID NO: 1387 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1387 
25 TTATGCCTCT TCCGACCATC AAGC 24 



2) INFORMATION FOR SEQ ID NO: 1388 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
35 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1388 

40 

TACGCTCGTC ATCAAAATCA CTCG 24 



45 2) INFORMATION FOR SEQ ID NO: 1389 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1389 
GAATAACGGT TTGGTTGATG CGAG 



2) INFORMATION FOR SEQ ID NO: 1390 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1390 
ATGGCAAGAT CCTGGTATCG GTCT 



2) INFORMATION FOR SEQ ID NO: 1391 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: J01839 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1391 

ATGAGCCATA TTCAACGGGA AACGTCTTGC TCGAGGCCGC GATTAAATTC 50 

CAACATGGAT GCTGATTTAT ATGGGTATAA ATGGGCTCGC GATAATGTCG 100 

GGCAATCAGG TGCGACAATC TATCGATTGT ATGGGAAGCC CGATGCGCCA 150 

GAGTTGTTTC TGAAACATGG CAAAGGTAGC GTTGCCAATG ATGTTACAGA 200 

TGAGATGGTC AGACTAAACT GGCTGACGGA ATTTATGCCT CTTCCGACCA 250 

TCAAGCATTT TATCCGTACT CCTGATGATG CATGGTTACT CACCACTGCG 300 

ATCCCCGGGA AAACAGCATT CCAGGTATTA GAAGAATATC CTGATTCAGG 350 

TGAAAATATT GTTGATGCGC TGGCAGTGTT CCTGCGCCGG TTGCATTCGA 400 

TTCCTGTTTG TAATTGTCCT TTTAACAGCG ATCGCGTATT TCGTCTCGCT 450 

CAGGCGCAAT CACGAATGAA TAACGGTTTG GTTGATGCGA GTGATTTTGA 500 

TGACGAGCGT AATGGCTGGC CTGTTGAACA AGTCTGGAAA GAAATGCATA 550 

AGCTTTTGCC ATTCTCACCG GATTCAGTCG TCACTCATGG TGATTTCTCA 600 

CTTGATAACC TTATTTTTGA CGAGGGGAAA TTAATAGGTT GTATTGATGT 650 

TGGACGAGTC GGAATCGCAG ACCGATACCA GGATCTTGCC ATCCTATGGA 700 
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ACTGCCTCGG TGAGTTTTCT CCTTCATTAC AGAAACGGCT TTTTCAAAAA 750 
TATGGTATTG ATAATCCTGA TATGAATAAA TTGCAGTTTC ATTTGATGCT 800 
CGATGAGTTT TTCTAA 816 



2) INFORMATION FOR SEQ ID NO: 1392 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1392 
TGGGTGGAGA GGCTATTCGG CTAT 24 

20 

2) INFORMATION FOR SEQ ID NO: 1393 

25 (i) SEQUENCE CHARACTERISTICS: 

-(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1393 
35 CAGTCCCTTC CCGCTTCAGT GAC 23 



2) INFORMATION FOR SEQ ID NO: 1394 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1394 

50 

GACGTTGTCA CTGAAGCGGG AAGG 24 
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2) INFORMATION FOR SEQ ID NO: 1395 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1395 
CTTGGTGGTC GAATGGGCAG GTAG 



2) INFORMATION FOR SEQ ID NO: 1396 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: V00618 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1396 

ATGATTGAAC AAGATGGATT GCACGCAGGT TCTCCGGCCG CTTGGGTGGA 50 

GAGGCTATTC GGCTATGACT GGGCACAACA GACAATCGGC TGCTCTGATG 100 

CCGCCGTGTT CCGGCTGTCA GCGCAGGGGC GCCCGGTTCT TTTTGTCAAG 150 

ACCGACCTGT CCGGTGCCCT GAATGAACTG CAGGACGAGG CAGCGCGGCT 200 

ATCGTGGCTG GCCACGACGG GCGTTCCTTG CGCAGCTGTG CTCGACGTTG 250 

TCACTGAAGC GGGAAGGGAC TGGCTGCTAT TGGGCGAAGT GCCGGGGCAG 300 

GATCTCCTGT CATCTCACCT TGCTCCTGCC GAGAAAGTAT CCATCATGGC 350 

TGATGCAATG CGGCGGCTGC ATACGCTTGA TCCGGCTACC TGCCCATTCG 400 

ACCACCAAGC GAAACATCGC ATCGAGCGAG CACGTACTCG GATGGAAGCC 450 

GGTCTTGTCG ATCAGGATGA TCTGGACGAA GAGCATCAGG GGCTCGCGCC 500 

AGCCGAACTG TTCGCCAGGC TCAAGGCGCG CATGCCCGAC GGCGAGGATC 550 

TCGTCGTGAC CCATGGCGAT GCCTGCTTGC CGAATATCAT GGTGGAAAAT 600 

GGCCGCTTTT CTGGATTCAT CGACTGTGGC CGGCTGGGTG TGGCGGACCG 650 

CTATCAGGAC ATAGCGTTGG CTACCCGTGA TATTGCTGAA GAGCTTGGCG 700 

GCGAATGGGC TGACCGCTTC CTCGTGCTTT ACGGTATCGC CGCTCCCGAT 750 

TCGCAGCGCA TCGCCTTCTA TCGCCTTCTT GACGAGTTCT TCTGA 7 95 



2) INFORMATION FOR SEQ ID NO: 1397 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1397 
GTGGGAGAAA ATGAAAACCT AT 



2) INFORMATION FOR SEQ ID NO: 1398 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1398 
ATGGAGTGAA AGAGCCTGAT 



2) INFORMATION FOR SEQ ID NO: 1399 

(i) SEQUENCE CHARACTERISTICS:' 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1399 
ACCTATGATG TGGAACGGGA AAAG 



2) INFORMATION FOR SEQ ID NO: 1400 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1400 
CGATGGAGTG AAAGAGCCTG ATG 23 



10 2) INFORMATION FOR SEQ ID NO: 1401 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 95 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(C) ACCESSION NUMBER: V01547 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1401 



ATGGCTAAAA TGAGAATATC ACCGGAATTG AAAAAACTGA TCGAAAAATA 50 

CCGCTGCGTA AAAGATACGG AAGGAATGTC TCCTGCTAAG GTATATAAGC 100 

TGGTGGGAGA AAATGAAAAC CTATATTTAA AAATGACGGA CAGCCGGTAT 150 

AAAGGGACCA CCTATGATGT GGAACGGGAA AAGGACATGA TGCTATGGCT 200 

3 0 GGAAGGAAAG CTGCCTGTTC CAAAGGTCCT GCACTTTGAA CGGCATGATG 250 

GCTGGAGCAA TCTGCTCATG AGTGAGGCCG ATGGCGTCCT TTGCTCGGAA 300 

GAGTATGAAG ATGAACAAAG CCCTGAAAAG ATTATCGAGC TGTATGCGGA 350 

GTGCATCAGG CTCTTTCACT CCATCGACAT ATCGGATTGT CCCTATACGA 4 00 

ATAGCTTAGA CAGCCGCTTA GCCGAATTGG ATTACTTACT GAATAACGAT 4 50 

3 5 CTGGCCGATG TGGATTGCGA AAACTGGGAA GAAGACACTC CATTTAAAGA 500 

TCCGCGCGAG CTGTATGATT TTTTAAAGAC GGAAAAGCCC GAAGAGGAAC 550 

TTGTCTTTTC CCACGGCGAC CTGGGAGACA GCAACATCTT TGTGAAAGAT 600 

GGCAAAGTAA GTGGCTTTAT TGATCTTGGG AGAAGCGGCA GGGCGGACAA 650 

GTGGTATGAC ATTGCCTTCT GCGTCCGGTC GATCAGGGAG GATATCGGGG 700 

40 AAGAACAGTA TGTCGAGCTA TTTTTTGACT TACTGGGGAT CAAGCCTGAT 750 

TGGGAGAAAA TAAAATATTA TATTTTACTG GATGAATTGT TTTAG 7 95 



45 2) INFORMATION FOR SEQ ID NO: 1402 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1402 
TATTCAACAA TTTATCGGAA ACAG 24 

5 



2) INFORMATION FOR SEQ ID NO: 1403 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1403 
20 TCAGAGAGCC AACTCAACAT TTT 23 



10 



15 



2) INFORMATION FOR SEQ ID NO: 1404 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1404 

35 

AAACAGCGTT TTAGAGCCAA ATAA 24 



40 2) INFORMATION FOR SEQ ID NO: 1405 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1405 

TTCTCAGAGA GCCAACTCAA CATT 24 
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2) INFORMATION FOR SEQ ID NO: 1406 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetobacter baumannii 

(B) STRAIN: BM2580 

(C) ACCESSION NUMBER: X07753 

(xi) SEQUENCE DESCRIPTION: SEQ , ID NO: 1406 



ATGGAATTGC CCAATATTAT TCAACAATTT ATCGGAAACA GCGTTTTAGA 50 

GCCAAATAAA ATTGGTCAGT CGCCATCGGA TGTTTATTCT TTTAATCGAA 100 

20 ATAATGAAAC TTTTTTTCTT AAGCGATCTA GCACTTTATA TACAGAGACC 150 

ACATACAGTG TCTCTCGTGA AGCGAAAATG TTGAGTTGGC TCTCTGAGAA 200 

ATTAAAGGTG CCTGAACTCA TCATGACTTT TCAGGATGAG CAGTTTGAAT 250 

TCATGATCAC TAAAGCGATC AATGCAAAAC CAATTTCAGC GCTTTTTTTA 300 

ACAGACCAAG AATTGCTTGC TATCTATAAG GAGGCACTCA ATCTGTTAAA 350 

25 TTCAATTGCT ATTATTGATT GTCCATTTAT TTCAAACATT GATCATCGGT 400 

TAAAAGAGTC AAAATTTTTT ATTGATAACC AACTCCTTGA CGATATAGAT 450 

CAAGATGATT TTGACACTGA ATTATGGGGA GACCATAAAA CTTACCTAAG 500 

TCTATGGAAT GAGTTAACCG AGACTCGTGT TGAAGAAAGA TTGGTTTTTT 550 

CTCATGGCGA TATCACGGAT AGTAATATTT TTATAGATAA ATTCAATGAA 600 

30 ATTTATTTTT TAGATCTTGG TCGTGCTGGG TTAGCAGATG AATTTGTAGA 650 

TATATCCTTT GTTGAACGTT GCCTAAGAGA GGATGCATCG GAGGAAACTG 700 

CGAAAATATT TTTAAAGCAT TTAAAAAATG ATAGACCTGA CAAAAGGAAT 7 50 

TATTTTTTAA AACTTGATGA ATTGAATTGA 7 80 



2) INFORMATION FOR SEQ ID NO: 1407 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1407 
CCCTGTAATA GAAAAGCAAG TAGG 24 

50 



2) INFORMATION FOR SEQ ID NO: 1408 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1408 
TTGTCGTATC CCTCAAATCA CC 



2) INFORMATION FOR SEQ ID NO: 1409 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1409 
TGGGATTACA ATGGCAATCA GCG 



2) INFORMATION FOR SEQ ID NO: 1410 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1410 
GGGGAATAGG TCACAAGATC TGCTT 



2) INFORMATION FOR SEQ ID NO: 1411 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 912 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 
(C) ACCESSION NUMBER: S46063 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1411 

ATGCTTTTAT ATAAAATGTG TGACAATCAA AATTATGGGG TTACTTACAT 50 

GAAGTTTTTA TTGGCATTTT CGCTTTTAAT ACCATCCGTG GTTTTTGCAA 100 

GTAGTTCAAA GTTTCAGCAA GTTGAACAAG ACGTTAAGGC AATTGAAGTT 150 

TCTCTTTCTG CTCGTATAGG TGTTTCCGTT CTTGATACTC AAAATGGAGA 200 

ATATTGGGAT TACAATGGCA ATCAGCGCTT CCCGTTAACA AGTACTTTTA 250 

AAACAATAGC TTGCGCTAAA TTACTATATG ATGCTGAGCA AGGAAAAGTT 300 

AATCCCAATA GTACAGTCGA GATTAAGAAA GCAGATCTTG TGACCTATTC 350 

CCCTGTAATA GAAAAGCAAG TAGGGCAGGC AATCACACTC GATGATGCGT 400 

GCTTCGCAAC TATGACTACA AGTGATAATA CTGCGGCAAA TATCATCCTA 450 

AGTGCTGTAG GTGGCCCCAA AGGCGTTACT GATTTTTTAA GACAAATTGG 500 

GGACAAAGAG ACTCGTCTAG ACCGTATTGA GCCTGATTTA AATGAAGGTA 550 

AGCTCGGTGA TTTGAGGGAT ACGACAACTC CTAAGGCAAT AGCCAGTACT 600 

TTGAATAAAC TTTTATTTGG TTCCGCGCTA TCTGAAATGA ACCAGAAAAA 650 

ATTAGAGTCT TGGATGGTGA ACAATCAAGT CACTGGTAAT TTACTACGTT 700 

CAGTATTGCC GGCGGGATGG AACATTGCGG ATCGCTCAGG TGCTGGCGGA 750 

TTTGGTGCTC GGAGTATTAC AGCAGTTGTG TGGAGTGAGC ATCAAGCCCC 800 

AATTATTGTG AGCATCTATC TAGCTCAAAC ACAGGCTTCA ATGGCAGAGC 850 

GAAATGATGC GATTGTTAAA ATTGGTCATT CAATTTTTGA CGTTTATACA 900 

TCACAGTCGC GC 912 



2) INFORMATION FOR SEQ ID NO: 1412 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1412 
GAGAAAACGC TCCAGCAGGG C 



2) INFORMATION FOR SEQ ID NO: 1413 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1413 
CATGAGGCTT TCACTGCGGG G 



2) INFORMATION FOR SEQ ID NO: 1414 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1414 
TATCGTTAAT CGCACCATCA C 



2) INFORMATION FOR SEQ ID NO: 1415 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1415 
ATGCAGTAAT GCGGCTTTAT C 



2) INFORMATION FOR SEQ ID NO: 1416 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1146 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: HEL-1 

(C) ACCESSION NUMBER: X91840 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1416 



ATGATGAAAA AATCGTTATG CTGCGCTCTG CTGCTGACAG CCTCTTTCTC 50 

CACATTTGCT GCCGCAAAAA CAGAACAACA GATTGCCGAT ATCGTTAATC 100 

10 GCACCATCAC CCCGTTGATG CAGGAGCAGG CTATTCCGGG TATGGCCGTT 150 

GCCGTTATCT ACCAGGGAAA ACCCTATTAT TTCACCTGGG GTAAAGCCGA 200 

TATCGCCAAT AACCACCCAG TCACGCAGCA AACGCTGTTT GAGCTAGGAT 250 

CGGTTAGTAA GACGTTTAAC GGCGTGTTGG GCGGCGATGC TATCGCCCGC 300 

GGCGAAATTA AGCTCAGCGA TCCGGTCACG AAATACTGGC CAGAACTGAC 350 

15 AGGCAAACAG TGGCAGGGTA TCCGCCTGCT GCACTTAGCC ACCTATACGG 400 

CAGGCGGCCT ACCGCTGCAG ATCCCCGATG ACGTTAGGGA TAAAGCCGCA 4 50 

TTACTGCATT TTTATCAAAA CTGGCAGCCG CAATGGACTC CGGGCGCTAA 500 

GCGACTTTAC GCTAACTCCA GCATTGGTCT GTTTGGCGCG CTGGCGGTGA 550 

AACCCTCAGG AATGAGTTAC GAAGAGGCAA TGACCAGACG CGTCCTGCAA 600 

2 0 CCATTAAAAC TGGCGCATAC CTGGATTACG GTTCCGCAGA ACGAACAAAA 650 

AGATTATGCC TGGGGCTATC GCGAAGGGAA GCCCGTACAC GTTTCTCCGG ' 700 

GACAACTTGA CGCCGAAGCC TATGGCGTGA AATCCAGCGT TATTGATATG 750 

GCCCGCTGGG TTCAGGCCAA CATGGATGCC AGCCACGTTC AGGAGAAAAC 800 

GCTCCAGCAG GGCATTGCGC TTGCGCAGTC TCGCTACTGG CGTATTGGCG 850 

25 ATATGTACCA GGGATTAGGC TGGGAGATGC TGAACTGGCC GCTGAAAGCT 900 

GATTCGATCA TCAACGGCAG CGACAGCAAA GTGGCATTGG CAGCGCTTCC 950 

CGCCGTTGAG GTAAACCCGC CCGCCCCCGC AGTGAAAGCC TCATGGGTGC 1000 

ATAAAACGGG CTCCACTGGT GGATTTGGCA GCTACGTAGC CTTCGTTCCA 1050 

GAAAAAAACC TTGGCATCGT GATGCTGGCA AACAAAAGCT ATCCTAACCC 1100 

3 0 TGTCCGTGTC GAGGCGGCCT GGCGCATTCT TGAAAAGCTG CAATAA 114 6 



2) INFORMATION FOR SEQ ID NO: 1417 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1417 

45 

TGGTTAACTA YAATCCSATT GCGGA 25 



50 2) INFORMATION FOR SEQ ID NO: 1418 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1418 
ATGCTTTACC CAGCGTCAGA TT 



2) INFORMATION FOR SEQ ID NO: 1419 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1419 
CGATGAATAA GCTGATTTCT CACG 



2) INFORMATION FOR SEQ ID NO: 1420 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1420 
TGCTTTACCC AGCGTCAGAT TACG 



2) INFORMATION FOR SEQ ID NO: 1421 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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25 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1421 
AATTAGAGCG GCAGTCGGGA GGAA 24 



2) INFORMATION FOR SEQ ID NO: 1422 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1422 
2 0 GAAATCAGCT TATTCATCGC CACG 24 



2) INFORMATION FOR SEQ ID NO: 1423 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 6 bases 

(B) TYPE: Nuoleic acid 

(C) STRANDEDNESS: Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Escherichia coli 

(B) STRAIN: GRI-1 

(C) ACCESSION NUMBER: X92506 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1423 



ATGGTTAAAA AATCACTGCG TCAGTTCACG CTGATGGCGA CGGCAACCGT 50 

CACGCTGTTG TTAGGAAGTG TGCCGCTGTA TGCGCAAACG GCGGACGTAC 100 

AGCAAAAACT TGCCGAATTA GAGCGGCAGT CGGGAGGAAG ACTGGGTGTG 150 

GCATTGATTA ACACAGCAGA TAATTCGCAA ATACTTTATC GTGCTGATGA 200 

45 GCGCTTTGCG ATGTGCAGCA CCAGTAAAGT GATGGCCGTG GCCGCGGTGC 250 

TGAAGAAAAG TGAAAGCGAA CCGAATCTGT TAAATCAGCG AGTTGAGATC 300 

AAAAAATCTG ACTTGGTTAA CTATAATCCG ATTGCGGAAA AGCACGTCGA 350 

TGGGACGATG TCACTGGCTG AGCTTAGCGC GGCCGCGCTA CAGTACAGCG 400 

ATAACGTGGC GATGAATAAG CTGATTTCTC ACGTTGGCGG CCCGGCTAGC 450 

50 GTCACCGCGT TCGCCCGACA GCTGGGAGAC GAAACGTTCC GTCTCGACCG 500 

TACCGAGCCG ACGTTAAACA CCGCCATTCC GGGCGATCCG CGTGATAGCA 550 

CTTCACCTCG GGCAATGGCG CAAACTCTGC GTAATCTGAC GCTGGGTAAA 600 

GCATTGGGTG ACAGCCAACG GGCGCAGCTG GTGACATGGA TGAAAGGCAA 650 
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TACCACCGGT GCAGCGAGCA TTCAGGCTGG 
TGGGGGATAA AACCGGCAGC GGTGACTATG 
GTGATCTGGC CAAAAGATCG TGCGCCGCTG 
CCAGCCTCAA CCTAAGGCAG AAAGCCGTCG 
CTAAAATCGT CACCAACGGT TTGTAA 



ACTGCCTGCT TCCTGGGTTG 700 
GCACCACCAA CGATATCGCG 750 
ATTCTGGTCA CTTACTTCAC 800 
CGATGTATTA GCGTCGGCGG 850 

876 



2) INFORMATION FOR SEQ ID NO: 1424 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1424 
GTTAACGGTG ATGGCGACGC TAC 



2) INFORMATION FOR SEQ ID NO: 1425 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1425 
GAATTATCGG CGGTGTTAAT CAGC 



2) INFORMATION FOR SEQ ID NO: 1426 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1426 
CACGCTCAAT ACCGCCATTC CA 
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2) INFORMATION FOR SEQ ID NO: 1427 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1427 
TTATCGCCCA CTACCCATGA TTTC 24 



2) INFORMATION FOR SEQ ID NO: 1428 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 876 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella typhimurium 

(B) STRAIN: CAS-5 

30 (C) ACCESSION NUMBER: X92507 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1428 

ATGATGACTC AGAGCATTCG CCGCTCAATG TTAACGGTGA TGGCGACGCT 50 

35 ACCCCTGCTA TTTAGCAGCG CAACGCTGCA TGCGCAGGCG ' AACAGCGTGC 100 

AACAGCAGCT GGAAGCCCTG GAGAAAAGTT CGGGAGGTCG GCTTGGCGTT 150 

GCGCTGATTA ACACCGCCGA TAATTCGCAG ATTCTCTACC GTGCCGATGA 200 

ACGTTTTGCG ATGTGCAGTA CCAGTAAGGT GATGGCGGCC GCGGCGGTGC 250 

TTAAACAGAG CGAGAGCGAT AAGCACCTGC TAAATCAGCG CGTTGAAATC 300 

4 0 AAGAAGAGCG ACCTGGTTAA CTACAATCCC ATTGCGGAGA AACACGTTAA 350 

CGGCACGATG ACGCTGGCTG AGCTTGGCGC AGCGGCGCTG CAGTATAGCG 4 00 

ACAATACTGC CATGAATAAG CTGATTGCCC ATCTGGGTGG TCCCGATAAA 4 50 

GTGACGGCGT TTGCTCGCTC GTTGGGTGAT GAGACCTTCC GTCTGGACAG 500 

AACCGAGCCC ACGCTCAATA CCGCCATTCC AGGCGACCCG CGTGATACCA 550 

45 CCACGCCGCT CGCGATGGCG CAGACCCTGA AAAATCTGAC GCTGGGTAAA 600 

GCGCTGGCGG AAACTCAGCG GGCACAGTTG GTGACGTGGC TTAAGGGCAA 650 

TACTACCGGT AGCGCGAGCA TTCGGGCGGG TCTGCCGAAA TCATGGGTAG 700 

TGGGCGATAA AACCGGCAGC GGAGATTATG GCACCACCAA CGATATCGCG 750 

GTTATCTGGC CGGAAAACCA CGCACCGCTG GTTCTGGTGA CCTACTTTAC 800 

50 CCAACCGGAG CAGAAGGCGG AAAGCCGTCG GGATATTCTG GCTGCGGCGG 850 

CGAAAATCGT AACCCACGGT TTCTGA 87 6 
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2) INFORMATION FOR SEQ ID NO: 1429 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1429 
TTTACGGCTA AAGATACTGA AAAGT 



2) INFORMATION FOR SEQ ID NO: 1430 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ YD NO: 1430 
GTTTAATAAA ACAACCACCG AATAAT 



2) INFORMATION FOR SEQ ID NO: 1431 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1431 
TAATTGACAC TCCATTTACG GCTAA 



2) INFORMATION FOR SEQ ID NO: 1432 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1432 
ACCGAATAAT ATTTTCCTTT CAGGCA 26 



2) INFORMATION FOR SEQ ID NO: 1433 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 741 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

2 0 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 
(C) ACCESSION NUMBER: AJ223604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1433 



ATGAGCAAGT TATCTGTATT CTTTATATTT TTGTTTTGCA GCATTGCTAC 50 

CGCAGCAGAG TCTTTGCCAG ATTTAAAAAT TGAAAAGCTT GATGAAGGCG 100 

3 0 TTTATGTTCA TACTTCGTTT GAAGAAGTTA ACGGGTGGGG CGTTGTTCCT 150 

AAACATGGTT TGGTGGTTCT TGTAAATGCT GAGGCTTACC TAATTGACAC 200 

TCCATTTACG GCTAAAGATA CTGAAAAGTT AGTCACTTGG TTTGTGGAGC 250 

GTGGCTATAA AATAAAAGGC AGCATTTCCT CTCATTTTCA TAGCGACAGC 300 

ACGGGCGGAA TAGAGTGGCT TAATTCTCGA TCTATCCCCA CGTATGCATC 350 

35 TGAATTAACA AATGAACTGC TTAAAAAAGA CGGTAAGGTT CAAGCCACAA 400 

ATTCATTTAG CGGAGTTAAC TATTGGCTAG TTAAAAATAA AATTGAAGTT 450 

TTTTATCCAG GCCCGGGACA CACTCCAGAT AACGTAGTGG TTTGGTTGCC 500 

TGAAAGGAAA ATATTATTCG GTGGTTGTTT TATTAAACCG TACGGTTTAG 550 

GCAATTTGGG TGACGCAAAT ATAGAAGCTT GGCCAAAGTC CGCCAAATTA 600 

40 TTAAAGTCCA AATATGGTAA GGCAAAACTG GTTGTTCCAA GTCACAGTGA 650 

AGTTGGAGAC GCATCACTCT TGAAACTTAC ATTAGAGCAG GCGGTTAAAG 700 

GGTTAAACGA AAGTAAAAAA CCATCAAAAC CAAGCAACTA A 741 



45 

2) INFORMATION FOR SEQ ID NO: 1434 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1434 
5 CACAATCAAG ACCAAGATTT GCGAT 25 



2) INFORMATION FOR SEQ ID NO: 1435 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1435 

20 

GAAAGGGCAG CTCGTTACGA TAGAG 25 



25 2) INFORMATION FOR SEQ ID NO: 1436 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1436 

CAGCATCAAC ATTTAAGATC CCCA 24 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1437 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1437 

CTCCACTTGA TTAACTGCGG AAATTC 26 
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2) INFORMATION FOR SEQ ID NO: 1438 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: X06046 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1438 

ATGGCAATCC GAATCTTCGC GATACTTTTC TCCATTTTTT CTCTTGCCAC 50 

TTTCGCGCAT GCGCAAGAAG GCACGCTAGA ACGTTCTGAC TGGAGGAAGT 100 

TTTTCAGCGA ATTTCAAGCC AAAGGCACGA TAGTTGTGGC AGACGAACGC 150 

20 CAAGCGGATC GTGCCATGTT GGTTTTTGAT CCTGTGCGAT CGAAGAAACG 200 

CTACTCGCCT GCATCGACAT TCAAGATACC TCATACACTT TTTGCACTTG 250 

ATGCAGGCGC TGTTCGTGAT GAGTTCCAGA TTTTTCGATG GGACGGCGTT 300 

AACAGGGGCT TTGCAGGCCA CAATCAAGAC CAAGATTTGC GATCAGCAAT 350 

GCGGAATTCT ACTGTTTGGG TGTATGAGCT ATTTGCAAAG GAAATTGGTG 400 

2 5 ATGACAAAGC TCGGCGCTAT TTGAAGAAAA TCGACTATGG CAACGCCGAT 450 

CCTTCGACAA GTAATGGCGA TTACTGGATA GAAGGCAGCC TTGCAATCTC 500 

GGCGCAGGAG CAAATTGCAT TTCTCAGGAA GCTCTATCGT AACGAGCTGC 550 

CCTTTCGGGT AGAACATCAG CGCTTGGTCA AGGATCTCAT GATTGTGGAA 600 

GCCGGTCGCA ACTGGATACT GCGTGCAAAG ACGGGCTGGG AAGGCCGTAT 650 

3 0 GGGTTGGTGG GTAGGATGGG TTGAGTGGCC GACTGGCTCC GTATTCTTCG 700 

CACTGAATAT TGATACGCCA AACAGAATGG ATGATCTTTT CAAGAGGGAG 7 50 

GCAATCGTGC GGGCAATCCT TCGCTCTATT GAAGCGTTAC CGCCCAACCC 800 
GGCAGTCAAC TCGGACGCTG CGCGATAA 828 



35 



50 



2) INFORMATION FOR SEQ ID NO: 1439 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 801 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 
(C) ACCESSION NUMBER: J03427 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1439 

ATGAAAACAT TTGCCGCATA TGTAATTATC GCGTGTCTTT CGAGTACGGC 50 

758 
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ATTAGCTGGT TCAATTACAG AAAATACGTC 
CCGAAGCCGT CAATGGTGTC TTCGTGCTTT 
TGCGCTACCA ATGACTTAGC TCGTGCATCA 
AACATTTAAG ATCCCCAACG CAATTATCGG 
5 AGAATGAGCA TCAGGTTTTC AAATGGGACG 
CAATGGGAAA GAGACTTGAC CTTAAGAGGG 
TCCCGTATTT CAACAAATCG CCAGAGAAGT 
AATACCTTAA AAAATTTTCC TATGGCAACC 
GACAAATTCT GGTTGGAAGG CCAGCTTAGA 

10 GGAGTTTCTA GAGTCTCTAT ATTTAAATAA 
ACCAGCTAAT AGTAAAAGAG GCTTTGGTAA 
CTAGTGCATT CAAAAACTGG TTTTTCTGGT 
TGGTGTCGCA TGGTGGGTTG GGTGGGTTGA 
TTTTCGCCTT TAACATGGAT ATAGACAACG 

15 AAATCCATTC CCACCAAAAT CATGGAAAGT 
A 



TTGGAACAAA GAGTTCTCTG 100 

GTAAAAGTAG CAGTAAATCC 150 

AAGGAATATC TTCCAGCATC 200 

CCTAGAAACT GGTGTCATAA 250 

GAAAGCCAAG AGCCATGAAG 300 

GCAATACAAG TTTCAGCTGT 350 

TGGCGAAGTA AGAATGCAGA 4 00 

AGAATATCAG TGGTGGCATT 450 

ATTTCCGCAG TTAATCAAGT 500 

ATTGTCAGCA TCTAAAGAAA 550 
CGGAGGCGGC ACCTGAATAT ■ 600 

GTGGGAACTG AGTCAAATCC 650 

GAAGGAGACA GAGGTTTACT 700 

AAAGTAAGTT GCCGCTAAGA 750 

GAGGGCATCA TTGGTGGCTA 800 

801 



20 2) INFORMATION FOR SEQ ID NO: 1440 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1440 

AGACCGTTAT CGTAAACAGG GCTAAG 26 



35 



45 
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2) INFORMATION FOR SEQ ID NO: 1441 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1441 
TTTTTTGCTC AAACTTTTTC AGGATC 26 



2) INFORMATION FOR SEQ ID NO: 1442 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 927 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 

(B) STRAIN: RNL-1 

. (C) ACCESSION NUMBER: Z21957 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1442 

ATGAATGTCA TTATAAAAGC TGTAGTTACT GCCTCGACGC TACTGATGGT 50 

ATCTTTTAGT TCATTCGAAA CCTCAGCGCA ATCCCCACTG TTAAAAGAGC 100 

AAATTGAATC CATAGTCATT GGAAAAAAAG CCACTGTAGG CGTTGCAGTG 150 

TGGGGGCCTG ACGATCTGGA ACCTTTACTG ATTAATCCTT TTGAAAAATT 200 

CCCAATGCAA AGTGTATTTA AATTGCATTT AGCTATGTTG GTACTGCATC 250 

AGGTTGATCA GGGAAAGTTG GATTTAAATC AGACCGTTAT CGTAAACAGG 300 

GCTAAGGTTT TACAGAATAC CTGGGCTCCG ATAATGAAAG CGTATCAGGG 350 

AGACGAGTTT AGTGTTCCAG TGCAGCAACT GCTGCAATAC TCGGTCTCGC 400 

ACAGCGATAA CGTGGCCTGT GATTTGTTAT TTGAACTGGT TGGTGGACCA 4 50 

GCTGCTTTGC ATGACTATAT CCAGTCTATG GGTATAAAGG AGACCGCTGT 500 

GGTCGCAAAT GAAGCGCAGA TGCACGCCGA TGATCAGGTG CAGTATCAAA 550 

ACTGGACCTC GATGAAAGGT GCTGCAGAGA TCCTGAAAAA GTTTGAGCAA 600 

AAAACACAGC TGTCTGAAAC CTCGCAGGCT TTGTTATGGA AGTGGATGGT 650 

CGAAACCACC ACAGGACCAG AGCGGTTAAA AGGTTTGTTA CCAGCTGGTA 700 

CTGTGGTCGC ACATAAAACT GGTACTTCGG GTATCAAAGC CGGAAAAACT 750 

GCGGCCACTA ATGATTTAGG TATCATTCTG TTGCCTGATG GACGGCCCTT 800 

GCTGGTTGCT GTTTTTGTGA AAGACTCAGC CGAGTCAAGC CGAACCAATG 850 

AAGCTATCAT TGCGCAGGTT GCTCAGACTG CGTATCAATT TGAATTGAAA 900 

AAGCTTTCTG CCCTAAGCCC AAATTAA 927 



2) INFORMATION FOR SEQ ID NO: 1443 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1443 
CTTCTGCTCT GCTGATGCTT GGC 
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2) INFORMATION FOR SEQ ID NO: 1444 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1444 
GGCGACCAGG TATTTTGTAA TACTGC 26 



2) INFORMATION FOR SEQ ID NO: 1445 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 927 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella typhimurium 

(B) STRAIN: JMC 

30 (C) ACCESSION NUMBER: X93314 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1445 

ATGAATGTCA TCACAAAATG TGTTTTCACC GCTTCTGCTC TGCTGATGCT 50 

35 TGGCTTAAGT TCATTTGTAG TATCAGCCCA ATCCCCTTTG TTAAAAGAGC 100 

AGATTGAAAC CATAGTGACG GGTAAAAAGG CCACTGTAGG TGTAGCAGTG 150 

TGGGGGCCTG ACGATCTGGA ACCTTTGTTG CTGAATCCAT TTGAAAAGTT 200 

TCCGATGCAA AGTGTGTTTA AACTGCATTT AGCTATGTTA GTTCTGCATC 250 

AGGTCGATCA GGGGAAACTG GATTTAAATC AGTCTGTTAC TGTTAATCGT 300 

40 GCTGCAGTAT TACAAAATAC CTGGTCGCCA ATGATGAAAG ATCATCAGGG 350 

CGATGAATTT ACTGTTGCAG TACAGCAGTT ACTGCAGTAT TCGGTGTCAC 4 00 

ACAGCGACAA TGTGGCCTGC GATTTGTTAT TTGAACTGGT GGGCGGGCCG 4 50 

CAAGCTTTGC ATGCTTATAT CCAGTCTTTA GGCGTTAAAG AAGCTGCCGT 500 

GGTAGCAAAT GAAGCGCAAA TGCATGCGGA TGATCAGGTG CAATATCAAA 550 

45 ACTGGACGTC GATGAAAGCC GCAGCACAAG TTCTGCAAAA GTTTGAACAG 600 

AAAAAGCAGT TGTCTGAAAC CTCTCAGGCC TTGTTATGGA AATGGATGGT 650 

TGAAACCACC ACAGGACCAC AGCGGTTAAA AGGCTTGTTA CCTGCTGGTA 700 

CTATAGTGGC GCATAAAACC GGTACTTCGG GCGTCAGAGC AGGAAAAACT 750 

GCGGCGACTA ATGATGCGGG CGTCATTATG TTGCCTGATG GACGGCCTTT 800 

50 ATTGGTGGCG GTATTTGTCA AGGATTCGGC TGAATCAGAA CGAACCAATG 8 50 

AAGCTATTAT TGCGCAGGTT GCGCAAGCGG CTTATCAGTT TGAGCTGAAA 900 

AAACTCTCTG CAGTGAGTCC GGATTGA 927 
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2) INFORMATION FOR SEQ ID NO: 1446 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1446 
GGCCTGYGAT TTGTTATTTG AACTGGT 



2) INFORMATION FOR SEQ ID NO: 1447 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1447 
CGCTSTGGTC CTGTGGTGGT TTC 



2) INFORMATION FOR SEQ ID NO: 1448 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1448 
GATCAGGTGC ARTATCAAAA CTGGAC 



2) INFORMATION FOR SEQ ID NO: 1449 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1449 
AGCWGGTAAC AAYCCTTTTA ACCGCT 



2) INFORMATION FOR SEQ ID NO: 14 50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1450 
ACCACTGGGA ATACACTTGT AATGGC 



2) INFORMATION FOR SEQ ID NO: 1451 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1451 
ATCTACCTGG TCAATCATTG CTTCGT 



2) INFORMATION FOR SEQ ID NO: 1452 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 486 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: BM10393 

(C) ACCESSION NUMBER: AF045472 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1452 

ATGACATTAT CAATAATTGT CGCTCACGAT AAACAAAGAG TCATTGGGTA 50 

CCAAAATCAA TTACCTTGGC ACTTACCAAA TGATTTAAAG CATATTAAAC 100 

AACTGACCAC TGGGAATACA CTTGTAATGG CACGGAAAAC TTTTAATTCT 150 

ATAGGGAAGC CATTGCCAAA TAGACGTAAC GTCGTACTCA CTAACCAAGC 200 

TTCATTTCAC CATGAAGGGG TAGATGTTAT AAACTCTCTT GATGAAATTA 250 

AAGAGTTATC TGGTCATGTT TTTATATTTG GAGGACAAAC GTTATACGAA 300 

GCAATGATTG ACCAGGTAGA TGATATGTAT ATCACAGTAA TAGATGGAAA 350 

GTTTCAAGGA GACACATTCT TTCCACCATA CACATTCGAA AACTGGGAAG 400 

TCGAATCTTC AGTAGAAGGT CAACTAGATG AAAAAAATAC TATACCGCAT 4 50 

ACATTCTTAC ATTTAGTGCG TAGAAAAGGG AAATAG 4 86 



2) INFORMATION FOR SEQ ID NO: 1453 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1453 
ATCGAAGAAT GGAGTTATCG GRAATG 



2) INFORMATION FOR SEQ ID NO: 1454 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1454 
CCTAAAAYTR 'CTGGGGATTT CWGGA 



2) INFORMATION FOR SEQ ID NO: 1455 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1455 
CAGGTGGTGG GGAGATATAC AAAA 



2) INFORMATION FOR SEQ ID NO: 1456 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1456 
TATGTTAGAS RCGAAGTCTT GGKTAA 



2) INFORMATION FOR SEQ ID NO: 1457 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1457 
CAAAGGTGAA CAGCTCCTGT TT 



2) INFORMATION FOR SEQ ID NO: 1458 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1458 
TCCGTTATTT TCTTTAGGTT GGTTAAA 

2) INFORMATION FOR SEQ ID NO: 1459 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1459 
AAGGTGAACA GCTCCTGTTT 



2) INFORMATION FOR SEQ ID NO: 1460 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 60 
GATCACTACG TTCTCATTGT CA 



2) INFORMATION FOR SEQ ID NO: 1461 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: AJ238350 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 61 



5 



GTGAAACTAT CACTAATGGT AGCTATATCG AAGAATGGAG TTATCGGGAA 
TGGCCCTGAT ATTCCATGGA GTGCCAAAGG TGAACAGCTC CTGTTTAAAG 
CTATTACCTA TAACCAATGG CTGTTGGTTG GACGCAAGAC TTTTGAATCA 
ATGGGAGCAT TACCCAACCG AAAGTATGCG GTCGTAACAC GTCCAAGTTT 

10 TACATCTGAC AATGAGAACG TAGTGATCTT TCCATCAATT AAAGATGCTT 
TAACCAACCT AAAGAAAATA ACGGATCATG TCATTGTTTC AGGTGGTGGG 
GAGATATACA AAAGCCTGAT CGATCAAGTA GATACACTAC ATATATCTAC 
AATAGACATC GAGCCGGAAG GTGATGTTTA CTTTCCTGAA ATCCCCAGCA 
ATTTTAGGCC AGTTTTTACC CAAGACTTCG CCTCTAACAT AAATTATAGT 

15 TACCAAATCT GGCAAAAGGG TTAA 



2) INFORMATION FOR SEQ ID NO: 1462 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1462 

30 

GCACTCCCYA ATAGGAAATA CGC 



35 2) INFORMATION FOR SEQ ID NO: 1463 



50 
100 
150 
200 
250 
300 
350 
400 
450 
474 



20 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



25 



(i) 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



40 



(ii) 



MOLECULE TYPE: DNA 



45 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 1463 



AGTGTTGCTC AAAAACAACT TCG 



50 



2) INFORMATION FOR SEQ ID NO: 1464 



(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

5 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1464 
10 ACGTTYGAAT CTATGGGMGC ACT 23 



15 



45 



2) INFORMATION FOR SEQ ID NO: 1465 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
2 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1465 

25 

GTCGATAAGT GGAGCGTAGA GGC 23 



30 2) INFORMATION FOR SEQ ID NO: 1466 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1466 

AAGCATTGAC CTACAATCAG TGT 23 



2) INFORMATION FOR SEQ ID NO: 1467 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 27 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1467 
5 AATACAACTA CATTGTCATC ATTTGAT 27 



2) INFORMATION FOR SEQ ID NO: 1468 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 68 

20 

CGTTACCCGC TCAGGTTGGA CATCAA 26 



25 2) INFORMATION FOR SEQ ID NO: 1469 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1469 

CATCCCCCTC TGGCTCGATG TCG 23 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1470 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 474 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: Z50804 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1470 

TTGAAAGTAT CATTGATAGC TGCGAAACGA AAAAACGGCG TGATTGGTTG 50 

5 CGGTCCAGAC ATACCGTGGT CCGCGAAAGG GGAGCAGCTA CTTTTTAAAG 100 

CATTGACCTA CAATCAGTGT CTTCTGGTGG GTCGCAAGAC GTTTGAATCT 150 

ATGGGCGCAC TCCCCAATAG GAAATACGCG GTCGTTACCC GCTCAGGTTG 200 

GACATCAAAT GATGACAATG TAGTTGTATT TCAGTCAATC GAAGAGGCCA 250 

TGGACAGGCT AGCTGAATTC ACCGGTCACG TTATAGTGTC TGGTGGCGGA 300 

10 GAAATTTACC GAGAAACATT ACCCATGGCC TCTACGCTCC ACTTATCGAC 350 

GATCGACATC GAGCCAGAGG GGGATGTTTT CTTCCCGAGT ATTCCAAATA 400 

CCTTCGAAGT TGTTTTTGAG CAACACTTTA CTTCAAACAT TAACTATTGC 4 50 

TATCAAATTT GGAAAAAGGG TTAA 474 



15 



2) INFORMATION FOR SEQ ID NO: 1471 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1471 
GATAATGACA ACGTAATAGT ATTCCC 26 

30 



2) INFORMATION FOR SEQ ID NO: 1472 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1472 
45 GCTCAATATC AATCGTCGAT ATA 23 



2) INFORMATION FOR SEQ ID NO: 1473 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

( D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1473 
TTAAAGCCTT GACGTACAAC CAGTGG 26 



2) INFORMATION FOR SEQ ID NO: 1474 



<i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20. (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1474 
TGGGCAATGT TTCTCTGTAA ATCTCC 26 

25 



2) INFORMATION FOR SEQ ID NO: 1475 

3 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
40 (C) ACCESSION NUMBER: X12868 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1475 

GTGAAAGTAT CATTAATGGC TGCAAAAGCG AAAAACGGAG TGATTGGTTG 50 

45 CGGTCCACAC ATACCCTGGT CCGCGAAAGG AGAGCAGCTA CTCTTTAAAG 100 

CCTTGACGTA CAACCAGTGG CTTTTGGTGG GCCGCAAGAC GTTCGAATCT 150 

ATGGGAGCAC TCCCTAATAG GAAATACGCG GTCGTTACTC GCTCAGCCTG 200 

GACGGCCGAT AATGACAACG TAATAGTATT CCCGTCGATC GAAGAGGCCA 250 

TGTACGGGCT GGCTGAACTC ACCGATCACG TTATAGTGTC TGGTGGCGGG 300 

50 GAGATTTACA GAGAAACATT GCCCATGGCC TCTACGCTCC ATATATCGAC 350 

GATTGATATT GAGCCGGAAG GAGATGTTTT CTTTCCGAAT ATTCCCAATA 400 

CCTTCGAAGT TGTTTTTGAG CAACACTTTA GCTCAAACAT TAACTATTGC 4 50 

TATCAAATTT GGCAAAAGGG TTAA 474 
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2) INFORMATION FOR SEQ ID NO: 1476 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1476 
GGCGAGCAGC TCCTATTCAA AG 



2) INFORMATION FOR SEQ ID NO: 1477 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1477 
TAGGTAAGCT AATGCCGATT CAACA 



2) INFORMATION FOR SEQ ID NO: 1478 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1478 
GAGAATGGAG TAATTGGCTC TGGATT 



2) INFORMATION FOR SEQ ID NO: 1479 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 
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10 



40 



(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1479 
GCGAAATACA CAACATCAGG GTCAT 25 



2) INFORMATION FOR SEQ ID NO: 1480 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 474 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirabills 

(B) STRAIN: J120 

25 (C) ACCESSION NUMBER: Z86002 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1480 

ATGAAAATAT CTCTTATGGC AGCTGTTTCC GAGAATGGAG TAATTGGCTC 50 

3 0 TGGATTGGAT ATACCTTGGC ATGTACAAGG CGAGCAGCTC CTATTCAAAG 100 

CCATGACTTA CAATCAATGG CTTCTAGTTG GTCGTAAAAC CTTCGACTCA 150 

ATGGGTAAAC TTCCGAATAG AAAATATGCA GTGGTTACTC GTTCTAAAAT 200 

TATCTCGAAT GACCCTGATG TTGTGTATTT CGCAAGTGTT GAATCGGCAT 250 

TAGCTTACCT AAACAATGCG ACAGCACATA TCTTTGTTTC TGGTGGTGGT 300 

35 GAAATATATA AAGCTTTAAT CGATCAAGCA GATGTTATCC ATCTTTCAGT 350 

GATTCACAAG CATATCTCTG GCGATGTGTT TTTTCCTCCA GTTCCACAGG 400 

GCTTCAAGCA AACATTTGAG CAAAGTTTCA GTTCAAATAT TGATTACACG 450 

TACCAAATTT GGGCAAAGGG CTAA 4 74 



2) INFORMATION FOR SEQ ID NO: 1481 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1481 
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24 



5 2) INFORMATION FOR SEQ ID NO: 1482 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1482 

TAATTTATAT TAGACAWAAA AAACTG 26 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1483 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1483 
CARYGTCAGA AAATGGCGTA ATC 23 



2) INFORMATION FOR SEQ ID NO: 1484 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1484 
TKCAAAGCRW TTTCTATTGA AGGAAA 26 

50 



2) INFORMATION FOR SEQ ID NO: 1485 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1485 
AAAATGGCGT AATCGGTAAT GGC 



2) INFORMATION FOR SEQ ID NO: 1486 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1486 
CATTTGAGCT TGAAATTCCT TTCCTC 



2) INFORMATION FOR SEQ ID NO: 1487 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1487 
AATCGAAAAT ATGCAGTAGT GTCGAG 



2) INFORMATION FOR SEQ ID NO: 1488 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1488 
AGACTATTGT AGATTTGACC GCCA 



2) INFORMATION FOR SEQ ID NO: 1489 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: VA292 

(C) ACCESSION NUMBER: U31119 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1489 

TTGAAAATTT CATTGATTTC TGCAACGTCA GAAAATGGCG TAATCGGTAA 50 

TGGCCCTGAT ATCCCATGGT CAGCAAAAGG TGAGCAGTTA CTCTTTAAAG 100 

CGCTCACATA TAATCAGTGG CTCCTTGTTG GAAGGAAAAC ATTTGACTCT 150 

ATGGGTGTTC TTCCAAATCG AAAATATGCA GTAGTGTCGA GGAAAGGAAT 200 

TTCAAGCTCA AATGAAAATG TATTAGTCTT TCCTTCAATA GAAATCGCTT 250 

TGCAAGAACT ATCGAAAATT ACAGATCATT TATATGTCTC TGGTGGCGGT 300 

CAAATCTACA ATAGTCTTAT TGAAAAAGCA GATATAATTC ATTTGTCTAC 350 

TGTTCACGTT GAGGTTGAAG GTGATATCAA TTTTCCTAAA ATTCCAGAGA 400 

ATTTCAATTT GGTTTTTGAG CAGTTTTTTT TGTCTAATAT AAATTACACA 4 50 

TATCAGATTT GGAAAAAAGG CTAA 474 



2) INFORMATION FOR SEQ ID NO: 1490 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1490 
GACCTATGAG AGCTTGCCCG TCAAA 
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2) INFORMATION FOR SEQ ID NO: 14 91 



PCT/CAOO/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1491 
TCGCCTTCGT ACAGTCGCTT AACAAA 



2) INFORMATION FOR SEQ ID NO: 1492 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1492 
CATTTTAGCT GCCACCGCCA ATGGTT 



2) INFORMATION FOR SEQ ID NO: 1493 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1493 
GCGTCGCTGA CGTTGTTCAC GAAGA 



2) INFORMATION FOR SEQ ID NO: 1494 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 510 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: BL26A 

(C) ACCESSION NUMBER: U10186 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1494 

ATGATCGAGC TTCATGCCAT TTTAGCTGCC ACCGCCAATG GTTGCATTGG 50 

GAAGGACAAC GCACTTCCCT GGCCACCACT AAAAGGCGAT CTGGCCAGAT 100 

TCAAAAAATT GACCATGGGG AAGGTGGTCA TTATGGGGCG CAAGACCTAT 150 

GAGAGCTTGC CCGTCAAATT AGAAGGTCGC ACCTGCATCG TTATGACGCG 200 

CCAAGCGCTG GAGCTTCCGG GTGTTCGTGA CGCTAACGGC GCTATCTTCG 250 

TGAACAACGT CAGCGACGCC ATGCGGTTCG CTCAAGAAGA GAGCGTGGGC 300 

GATGTGGCCT ACGTCATTGG TGGCGCTGAG ATATTCAAGC GACTTGCCTT 350 

GATGATCACG CAGATTGAAT TGACCTTTGT TAAGCGACTG TACGAAGGCG 400 

ACACCTACGT TGATCTGGCC GAAATGGTCA AAGACTACGA GCAGAATGGC 450 

ATGGAAGAAC ATGACCTTCA CACTTACTTC ACTTACCGTA AAAAGGAGCT 500 

TACAGAATGA 510 



2) INFORMATION FOR SEQ ID NO: 1495 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1495 
TCTCTAAACA TGATTGTCGC TGTC 



2) INFORMATION FOR SEQ ID NO: 1496 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1496 
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CAGTGAGGCA AAAGTTTTTC TACC 



2) INFORMATION FOR SEQ ID NO: 1497 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1497 
CGGACGACTT CATGTGGTAG TCAGT 



2) INFORMATION FOR SEQ ID NO: 1498 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1498 
TTTGTTTTCA GTAATGGTCG GGACCT 



2) INFORMATION FOR SEQ ID NO: 1499 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: X57730 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:" 1499 
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ATGGCTTCTC TAAACATGAT 
ATTTGAAAAT CAGATTCCGT 
AAGCTGTTAC AATGAACTCA 
TCACTGCCTA AAGTGCTGCC 
5 AGTACCACCC ACCCAGAACA 
AGATCGCAGT AAGAACTGCA 
CAAATTTTTG TAATTGGTGG 
CGTGGACAAA CTCTACTTAA 
CTGAACTGGA TTTATCCCTA 
10 CCGACCATTA CTGAAAACAA 
CCCTAACCCT ATTAGTGAGG 



TGTCGCTGTC AATAAGACAG 
GGCATGAACC AGAAGATTTA 
GTTTTGATTA TGGGTAGAAA 
CGGACGACTT CATGTGGTAG 
CTGATCAAGT TGTGTATGTA 
AGCTTGTTGG TTGACAAACC 
GAAGAGTGCG TACGAGAACT 
CTAGAGTACA GCTCAACACA 
TTCAAGTCAT GGAAACTCGT 
AACAAAACTT ATTTTCCAAA 
AACCCACATG TTAG 



PCT/CA00/01150 

GAGGTATCGG 50 

AAACACTTCA 100 

AACTTTTGCC 150 

TCAGTAAAAC 200 

AGTACATACC 250 

AGAGTATTCT 300 

TAGCTGCCTA 350 

CAACAAGACA 400 

ATCTGAGGTC 450 

TTTGGATTAA 500 

534 



15 2) INFORMATION FOR SEQ ID NO: 1500 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

* 

(ii) MOLECULE TYPE: DNA 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1500 

ATCGGGTTAT TGGCAATGGT CCTA 24 



30 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1501 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1501 
GCGGTAGTTA GCTTGGCGTG AGATT 25 



2) INFORMATION FOR SEQ ID NO: 1502 



(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1502 

5 

GCGGGCGGAG CTGAGATATA CA 22 



10 2) INFORMATION FOR SEQ ID NO: 1503 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1503 

AACGGAGTGG GTGTACGGAA TTACAG 26 



25 



2) INFORMATION FOR SEQ ID NO: 1504 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 498 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

35 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: TKS84 

(C) ACCESSION NUMBER: Z21672 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1504 



ATGAACTCGG AATCAGTACG CATTTATCTC 
TCGGGTTATT GGCAATGGTC CTAATATCCC 

4 5 AGAAGATTTT TCGCAGACTC ACTGAGGGAA 
AAGACCTTTG AGTCTATCGG CAAGCCTCTA 
AATCTCACGC CAAGCTAACT ACCGCGCCAC 
CGCTGTCGCA CGCTATCGCT TTGGCATCCG 
GTCGCGGGCG GAGCTGAGAT ATACACTCTG 

50 CGTGTTTCTA TCTGAGGTAC ATCAAACCTT 
CAATGCTCAA CGAAACAGAA TTCGAGCTTG 
GCTGTAATTC CGTACACCCA CTCCGTTTAT 



GTTGCTGCGA TGGGAGCCAA 50 

CTGGAAAATT CCGGGTGAGC 100 

AAGTCGTTGT CATGGGGCGA 150 

CCGAACCGTC ACACATTGGT 200 

TGGCTGCGTA GTTGTTTCAA 250 

AACTCGGCAA TGAACTCTAC 300 

GCACTACCTC ACGCCCACGG 350 

CGAGGGTGAC GCCTTCTTCC 4 00 

TCTCAACCGA AACCATTCAA 4 50 

GCGCGTCGAA ACGGCTAA 4 98 
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2) INFORMATION FOR SEQ ID NO: 1505 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1505 
ATTTTTCGCA GGCTCACCGA GAGC 



2) INFORMATION FOR SEQ ID NO: 1506 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1506 
CGGATGAGAC AACCTCGAAT TCTGCTG 



2) INFORMATION FOR SEQ ID NO: 1507 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 498 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN : RA33.2 

(C) ACCESSION NUMBER: Z50802 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1507 

ATGAACCCGG AATCGGTCCG CATTTATCTG GTCGCTGCCA TGGGTGCCAA 50 

TCGGGTTATT GGCAATGGTC CCGATATCCC CTGGAAAATC CCAGGTGAGC 100 

AGAAGATTTT TCGCAGGCTC ACCGAGAGCA AAGTGGTCGT TATGGGCCGC 150 

AAGACATTTG AGTCCATAGG CAAGCCCTTA CCAAACCGCC ACACAGTGGT 200 
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GCTCTCGCGC CAAGCTGGTT ATAGCGCTCC 
CGCTGTCACA CGTATCGCCA TCGACAGCCG 
GTAGCGCGCG GAGCCGAGGT ATATGCGCTG 
CGTCTTTCTA TCTGAGGTAC ATCAAACCTT 
5 CAGTGCTTAA CGCAGCAGAA TTCGAGGTTG 
GGCACAATCA CGTACACGCA CTCCGTCTAT 



PCT/CA00/01150 

TGGTTGTGCA GTTGTTTCAA 250 

AACACGGCAA AGAACTCTAC 300 

GCGCTACCGC ATGCCAACGG 350 

TGAGGGTGAC GCCTTCTTCC 400 

TCTCATCCGA AACCATTCAA 450 

GCGCGTCGTA ACGGCTAA 4 98 



10 2) INFORMATION FOR SEQ ID NO: 1508 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1508 

AGAATGTATT GGTATTTCCA TCTATCG 27 



25 



35 



40 



2) INFORMATION FOR SEQ ID NO: 1509 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1509 
CAATGTCGAT TGTTGAAATA TGTAAA 26 



2) INFORMATION FOR SEQ ID NO: 1510 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1510 
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30- 



35 



50 
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TGGAGTGCCA AAGGGGAACA AT 



5 2) INFORMATION FOR SEQ ID NO: 1511 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1511 

CAGACACAAT CACATGATCC GTTATCG 27 



2) INFORMATION FOR SEQ ID NO: 1512 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 474 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: UI14 

(C) ACCESSION NUMBER: Z83331 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1512 



GTGAAACTAT CACTAATGGC AGCAATTTCG AAGAATGGAG TTATCGGAAA 50 

TGGCCCAGAT ATTCCATGGA GTGCCAAAGG GGAACAATTA CTCTTCAAAG 100 

40 CGATTACCTA TAATCAGTGG CTTTTGGTAG GCCGAAAGAC TTTCGAGTCA 150 

ATGGGGGCTT TACCCAACCG AAAATATGCC GTTGTAACTC GTTCAAGCTT 200 

CACTTCCAGT GATGAGAATG TATTGGTATT TCCATCTATC GATGAAGCGC 250 

TAAATCATCT GAAGACGATA ACGGATCATG TGATTGTGTC TGGTGGTGGT 300 

GAAATATACA AAAGCCTGAT CGATAAAGTT GATACTTTAC ATATTTCAAC 350 

4 5 AATCGACATT GAGCCAGAAG GTGATGTCTA TTTTCCAGAA ATCCCCAGTA 400 

GTTTTAGGCC AGTTTTTAGC CAAGACTTCG TGTCTAACAT AAATTATAGT 4 50 

TACCAAATCT GGCAAAAGGG TTAA 4 74 



2) INFORMATION FOR SEQ ID NO: 1513 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1513 
TTCAAGCTCA AATGAAAACG TCC 



2) INFORMATION FOR SEQ ID NO: 1514 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1514 
GAAATTCTCA GGCATTATAG GGAAT 



2) INFORMATION FOR SEQ ID NO: 1515 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1515 
GTGGTCAGTA AAAGGTGAGC AAC 



2) INFORMATION FOR SEQ ID NO: 1516 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear ' 
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CTATGTCTCA AGGCCGTGCA ACATACTCTA TGGAATTTGC TAAATATGCT 
GAAACTCCAC GTAACGTGGC TGAAGGCATC ATTTCTAAAT TTCAGTCTGG 
CGGTAAAAAA GGTGACGACG AGTAA 



50 
100 
125 



2) INFORMATION FOR SEQ ID NO: 1519 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetobacter lwoffi 

(B) STRAIN: CDCF 3697 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1519 

TCTTTCGATT ACTATAAGCC CAAACTAATT CATAGTTAAA AACCAAGTGC 50 
TCATGCAGTG ATCCTGCATG AGTAGTTTAA AAAGGAAGAT CTC 93 



2) INFORMATION FOR SEQ ID NO: 1520 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1106 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetobacter lwoffi 

(B) STRAIN: CDCF 3697 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1520 

ATGGCTAAGG CTAAGTTTGA ACGTAATAAG CCACACGTTA ACGTGGGCAC 50 

AATCGGTCAC GTTGACCATG GTAAAACAAC TTTAACAGCT GCAATTGCAA 100 

CTGTATGTGC GAAGAAATTC GGTGGCGAAG CGAAAGACTA CGCTGCAATT 150 

GACTCTGCAC CAGAAGAAAA AGCACGTGGT ATTACAATTA ATACTTCACA 200 

CGTAGAATAC GATTCTCCAA CTCGTCACTA CGCACACGTA GACTGCCCGG 250 

GCCACGCCGA TTATGTTAAA AACATGATTA CTGGTGCTGC TCAGATGGAC 300 

GGCGCGATCC TTGTATGTGC TGCGACTGAT GGTCCAATGC CACAGACTCG 350 

TGAACACATC CTTCTTTCTC GTCAGGTTGG TGTACCTTAC ATTCTTGTAT 4 00 

TCCTTAACAA GTGTGACCTT GTTGATGATG AAGAACTTCT TGAGCTAGTG 450 
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GAAATGGAAG TTCGTGAACT TCTTTCTACT 
CACTCCAGTT ATCCGTGGTT CAGCTCTTCT 
GTCAGTATGG CGAAGAAGCA GTTGTTGCGC 
TACATTCCAG AGCCAGTACG TGCAATCGAC 
CGAAGACGTA TTCTCTATTT CTGGTCGTGG 
TAGAAACTGG TATTGTGAAA GTAGGCGAAT 
CGTGATACTC AAGTAACTAC AGTTACTGGC 
GCTTGACGAA GGTCGTGCGG GCGAGAACTG 
CTAAGCGTGA AGACGTACAA CGTGGTCAAG 
ATCAAGCCAC ACACTAAATT CGATGCAGAA 
AGAAGGTGGT CGTCACACTC CATTCCTTAA 
ACTTCCGTAC AACTGACGTA ACTGGCGCGA 
GAAATGGTTA TGCCTGGTGA CAACGTAGAA 
CCCAAT 



PCT/CA00/01150 

TATGACTTCC CAGGTGATGA 500 

TGCACTTAAC GGTGACGCTG 550 

TTGTTGACGC ACTTGACACT 600 

CAAGCATTCT TAATGCCAAT 650 

TACAGTAGTA ACTGGCCGTG 700 

CAGTTGAAAT CGTTGGTATC 750 

GTAGAAATGT TCCGTAAATT 800 

TGGTGTTCTT CTACGTGGTA 850 

TACTTGCTAA ACCAGGTGCA 900 

GTATACGTAC TTTCTAAAGA 950 

CGGTTACCGT CCACAGTTCT 1000 

TCAAATTACA AGATGGCGTT 1050 

ATGTCAGTAG AATTAATCCA 1100 

1106 



2) INFORMATION FOR SEQ ID NO: 1521 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus influenzae 

(B) STRAIN: ATCC 9006 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1521 

ACAAACTCAA GGTCGTGCAT CTTACTCAAT GGAACCGTTA AAATATGCTG 50 
AAGCTCCAAC AAGTGTTGCG GCTGCAGTAA TTGAAGCGCG TAAAAAATAA 100 



2) INFORMATION FOR SEQ ID NO: 1522 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus influenzae 

(B) STRAIN: ATCC 9006 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1522 
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TTTTTGTAAA CCAGCGGTGT AAAATATGAT TGTTTTATAC CGCACTTCTT 50 
AGGAAACATT AGAA 64 



2) INFORMATION FOR SEQ ID NO: 1523 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1098 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Haemophilus influenzae 

(B) STRAIN: ATCC 9006 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1523 

ATGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTAA ACGTGGGTAC 50 

AATCGGCCAC GTTGACCACG GTAAAACAAC TTTAACAGCA GCAATTACAA 100 

CCGTATTAGC AAAACACTAC GGTGGTGCAG CGCGTGCATT TGACCAAATC 150 

GATAACGCGC CAGAAGAAAA AGCGCGTGGT ATTACCATCA ACACTTCACA 200 

TGTTGAATAC GATACACCAA CTCGCCACTA TGCACACGTA GACTGTCCAG 250 

GACACGCCGA CTATGTTAAA AACATGATTA CCGGTGCGGC GCAAATGGAT 300 

GGTGCTATTT TAGTAGTAGC AGCAACAGAT GGTCCTATGC CACAAACTCG 350 

TGAACATATC TTATTAGGTC GCCAAGTAGG TGTTCCATAC ATCATCGTAT 4 00 

TCTTAAACAA ATGCGACATG GTAGATGATG AAGAGTTATT AGAATTAGTA 4 50 

GAAATGGAAG TGCGTGAACT TCTATCTCAA TATGACTTCC CAGGTGACGA 500 

TACACCAATC GTACGTGGTT CAGCATTACA AGCATTGAAC GGCGTAGCAG 550 

AATGGGAAGA AAAAATCCTT GAATTAGCTG GTCACTTAGA TACTTACATC 600 

CCAGAACCAG AACGTGCGAT TGACCAACCG TTCCTTCTTC CAATTGAAGA 650 

CGTATTCTCA ATTTCAGGTC GTGGTACAGT AGTAACTGGT CGTGTAGAAC 700 

GTGGTATCAT CCGTACTGGT GATGAAGTAG AAATCGTCGG TATCAAAGAT 750 

ACAGCGAAAA CTACTGTAAC AGGTGTTGAA ATGTTCCGTA AATTACTTGA 8 00 

CGAAGGTCGT GCAGGTGAAA ACATCGGTGC ATTATTACGT GGTACCAAAC 850 

GTGAAGAAAT CGAACGTGGT CAAGTATTAG CGAAACCAGG TTCAATCACA 900 

CCACACACTG ATTTTGAATC AGAAGTATAC GTATTATCAA AAGATGAAGG 950 

TGGTCGTCAT ACTCCATTCT TCAAAGGTTA CCGTCCACAA TTCTATTTCC 1000 

GTACAACAGA CGTAACTGGT ACAATTGAAT TACCAGAAGG CGTGGAAATG 1050 

GTAATGCCAG GCGATAACAT CAAGATGACA GTAAGCTTAA TCCACCCA 1098 



2) INFORMATION FOR SEQ ID NO: 1524 

(i) SEQUENCE CHARACTERISTICS:; 

(A) LENGTH: 77 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirabilis 

(B) STRAIN: ATCC 25933 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1524 

CAATGGAGTT CTTGAAGTAC AACGAAGCGC CTAGCAACGT CGCTCAGGCT 50 
ATTATCGAAG CTCGTAAAGC GAAATAA 77 



2) INFORMATION FOR SEQ ID NO: 1525 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 67 bases 
20 (B) TYPE: Nucleic acid 

* (C) STRANDEDNESS: Double 
(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirabilis 

(B) STRAIN: ATCC 25933 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1525 

GATCCTTTCG AGTTCAATTT AGTTTACGCT CCCTCTGTGA GAGGGAGCGA 50 
TATTAAGGAA TATAGTC 67 



2) INFORMATION FOR SEQ ID NO: 1526 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1112 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirabilis 

(B) STRAIN: ATCC 25933 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1526 

GTGTCTAAAG AAAAATTTGA ACGTTCAAAA CCGCACGTTA ACGTTGGTAC 50 
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TATCGGCCAC GTTGACCACG GTAAAACAAC 
CAGTTTTAGC TAAAACTTAC GGTGGTGCTG 
GATAATGCAC CAGAAGAAAA AGCGCGTGGT 
CGTAGAATAC GATACTCCAA CTCGCCACTA 
5 GTCACGCCGA CTATGTTAAA AACATGATCA 
GGCGCTATTC TGGTAGTAGC AGCAACTGAT 
TGAGCACATC CTGTTAGGTC GTCAGGTTGG 
TCCTGAACAA ATGTGACATG GTAGATGATG 
GAAATGGAAG TTCGTGAACT TCTGTCTCAA 

10 CACTCCAGTA ATCCGTGGTT CAGCGCTGAA 
AGTGGGAAGC AAAAATTGTT GAATTAGCAG 
CCAGAGCCAG AGCGTGCAAT TGACAAACCA 
TGTATTCTCA ATCTCAGGCC GTGGTACAGT 
GTGGTATCAT CAAAGTAGGT GATGAAGTTG 

15 ACCACCAAAA CAACTTGTAC TGGCGTTGAA 
CGAAGGTCGT GCAGGTGAGA ACGTAGGTGT 
GTGAAGAAAT CGAACGTGGA CAAGTACTGG 
CCACACAACA AATTTGAATC AGAAGTTTAT 
TGGTCGTCAC ACACCATTCT TCAAAGGCTA 

20 GTACAACTGA CGTAACTGGT ACTATCGAAT 
GTAATGCCAG GCGACAACGT GAACATGATC 
CGCAATGGAC GA 



PCT/CA00/01150 

TCTGACTGCT GCAATCACTA 100 

CTCGTGCATT CGACCAAATC 150 

ATCACCATCT CTACTTCACA 200 

CGCACACGTA GACTGCCCAG 250 

CTGGTGCTGC GCAAATGGAC 300 

GGTCCAATGC CACAAACTCG 350 

TGTTCCTTAC ATCATCGTAT 400 

AAGAGCTGTT AGAATTAGTT 450 

TACGATTTCC CAGGTGATGA 500 

AGCACTGGAA GGCGAAGCAG 550 

AAGCACTGGA TTCTTATATC 600 

TTCCTGTTAC CAATCGAAGA 650 

AGTTACTGGT CGTGTAGAGC 700 

AGATTGTTGG TATCAAAGAA 750 

ATGTTCCGTA AATTACTTGA 800 

TCTGCTGCGT GGTACAAAAC 850 

CAAAACCAGG CTCAATCAAC 900 

ATTCTGAGCA AAGATGAAGG 950 

CCGTCCACAG TTCTACTTCC 1000 

TACCAGAAGG CGTAGAAATG 1050 

GTTGAACTGA TCCACCCAAT 1100 

1112 



25 



35 



2) INFORMATION FOR SEQ ID NO: 1527 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 800 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter curvus 

(B) STRAIN: ATCC 35224 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1527 

ATCAACGAAG CTATCGAGGT TTATTTTGAG GTTGAGGGCA AGAAAAATAG 50 

ATTGATCCTG GAGGTCGCGG CTCACTTGGG TGATAACCGC GTCAGAACGA 100 

TCGCTATGGA TATGAGTGAG GGGCTTACTC GCGGGCTTGA AGCTACCGCT 150 

45 CTTGGTGCGC CTATTAGTGT GCCGGTTGGC GAGAAGGTTT TGGGAAGAAT 200 

TTTTAACGTC GTCGGCGATC TCATCGACGA GGGCGAGGGC GTAAATTTTG 250 

ATAAACATTG GTCTATCCAC CGCGATCCGC CACCATTTGA AGAACAAAGC 300 

ACGAAAAGTG AAATTTTTGA AACCGGTATA AAGGTTGTGG ATCTTCTTGC 350 

GCCTTACGCA AAGGGCGGTA AGGTCGGACT ATTTGGCGGT GCAGGTGTCG 400 

50 GTAAAACGGT CATCATCATG GAGCTCATCC ACAATGTCGC CTTCAAACAC 450 

AGCGGATACT CTGTATTTGC AGGTGTTGGC GAGAGGACGC GCGAAGGAAA 500 

CGACCTTTAT CACGAGATGA AAGAAAGTAA CGTTTTGGAT AAAGTCGCCT 550 

TGTGCTACGG ACAGATGAAC GAGCCGCCAG GGGCGAGAAA TCGTATCGCA 600 
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CTGACTGGTC TAACGATGGC TGAGTATTTC 
TGTGCTTATG TTTATCGACA ACATCTTCCG 
AGATGTCGGC ACTCCTCGGA CGTATCCCAT 
ACGCTGGCAA GCGAGATGGG TAAATTTCAA 



PCT/CA00/01150 

CGCGATGAGA TGGGACTTGA 650 

CTTCTCTCAA TCTGGTGCAG 700 

CAGCCGTTGG TTACCAGCCG 750 

GAAAGGATCA CATCGACTAA 800 



2) INFORMATION FOR SEQ ID NO: 1528 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1528 
20 AACTTGAGCG ATTTTCGGAT ACCCTG 2 6 



25 



2) INFORMATION FOR SEQ ID NO: 1529 



(i) SEQUENCE CHARACTERISTICS:. 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1529 

35 

TTGCCGATGA AATAACCGCC GACT 24 



40 2) INFORMATION FOR SEQ ID NO: 1530 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1035 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: M11277 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1530 

ATGCGATTGG TTTGGAAATG TGGGGCGATT CAGGCATCCC GGTTATCTGA 50 

ATGGCTCAAC TCAACAGCCG GTGCTCATGA ACTTGAGCGA TTTTCGGATA 100 

CCCTGACCTT TTCTGTGTAT GGCTCAGTGC TGATCTGGCT GAAATCATAT 150 

CTCCGCGAAT CAGGAAGAAA ACTGCAGTTA GTCGGAATCG CCTTACCCAA 200 

CACCCTGAAC CCAAGGGACG ACCTAGCGCA ATTGGCCGAA ATTATCCAGC 250 

TCATCGATCA CCTCATGAAA CCGCACGTTG ATATGTTGAC TCACTTGTTG 300 

GCGTCCATTG ATGGCCAGTC GGCGGTTATT TCATCGGCAA AATGGGGGGA 350 

GCTAGAAACG GCTCGGCAGG AGAAAGCTAT CTCAGGGGTA ACCAGATTGA 400 

AGCTCCGCTT GGCGTCGCTT GCCCCCGTCC TGAAAAAACA CGTCAACAGC 450 

GATTTGTTCC GAAAAGCCTC TGATCGAATA GAGTCGATAG AGTATACGTT 500 

GGAAACCTTG CGTATAATGA AAACTTTCTT CGATGGTACC TCTCTTGAGG 550 

GAGATACTTC CGTACGTGAC TCGTATATGG CGGGCGTAGT AGATGGAATG 600 

GTTCGAGCGA ATCCGGATGT GAAGATAATT CTGCTGGCGC ACAACAATCA 650 

TCTACAAAAA ACTCCAGTCT CCTTTTCAGG CGAGCTTACG GCTGTTCCCA 700 

TGGGGCAGCA CCTCGCAGAG AGGGTGAATT ACCGTGCGAT TGCATTCACC 750 

CATCTTGGAC CCACCGTGCC GGAAATGCAT TTCCCATCGC CAAAAAGTCC 800 

TCTTGGATTC TCTGTTGTGA CCACGCCTGC CGATGCAATC CGTGAGGATA 850 

GTATGGAACA GTATGTCATC GACGCCTGTG GTACGGAGAA TTCATGTCTG 900 

ACATTGACAG ATGCCCCCAT GGAAGCAAAG CGAATGCGGT CTCAAAGCGC 950 

CTCTGTAGAA ACGAAATTGA GCGAGGCATT TGATGCCATC GTCTGTGTTA 1000 

CAAGCGCCGG CAAGGACAGC CTGGTTGCCC TATAG 1035 



2) INFORMATION FOR SEQ ID NO: 1531 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1531 
TCTTTTTGTT ACGACATACG CTTTT 



2) INFORMATION FOR SEQ ID NO: 1532 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1532 
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AGTGCTTCTT TATCCGCTGT TCTA 24 



2) INFORMATION FOR SEQ ID NO: 1533 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 26 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1533 
CAGCGGATAA AGAAGCACTA CACATT 26 



2) INFORMATION FOR SEQ ID NO: 1534 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1534 
CCTCCTGAAA TAAAGCCCGA CAT 23 

35 



2) INFORMATION FOR SEQ ID NO: 1535 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1260 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
50 (C) ACCESSION NUMBER: A15097 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1535 
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ATGAGGTTCG AAGAATGGGT CAAAGATAAG CATATTCCTT NCAAACNGAA 
TCACCCTGAT GATAATTACG ATGATTTTAA GCCATTAAGA AANATAATTG 
GAGATACCCG AGTTGTAGCA TTAGGTGAAA ATTCTCATTT CATAAAAGAA 
TTCTTTTTGT TACGACATAC GCTTTTGCGT TTTTTTATCG AAGATCTAGG 
5 TTTTACTACG TTTGCTTTTG AATTTGGTTT TGCTGAGGGT CAAATCATCA 
ATAACTGGAT ACATGGACAA GGAACTGACG ATGAAATAGG CAGATTCTTA 
AAACACTTCT ATTATCCAGA AGAGCTCAAA ACCACATTTC TATGGCTAAG 
GGAGTACAAT AAAGCAGCAA AAGAAAAAAT CACATTTCTT GGCATTGATA 
TACCCAGAAA TGGAGGTTCA TACTTACCAA ATATGGAGAT AGTGCATGAC 

10 TTTTTTAGAA CAGCGGATAA AGAAGCACTA CACATTATCG ATGATGCATT 
TAATATTGCA AAAAAGATTG ATTACTTCTC C AC AT C AC AG GCAGCCTTAA 
ATTTACATGA GCTAACAGAT TCTGAGAAAT GCCGTTTAAC TAGCCAATTA 
GCTCGAGTAA AAGTTCGCCT TGAAGCTATG GCTCCAATTC ACATTGAAAA 
ATATGGGATT GATAAATATG AGACAATTCT GCATTATGCC AACGGTATGA 

15 TATACTTGGA CTATAACATT CAAGCTATGT CGGGCTTTAT TTCAGGAGGC 
GGAATGCAGG GCGATATGGG TGCAAAAGAC AAATACATGG CAGATTCTGT 
GCTGTGGCAT TTAAAAAACC CACAAAGTGA GCAGAAAGTG ATAGTAGTAG 
CACATAATGC ACATATTCAA AAAACACCCA TTCTGTATGA TGGATTTCTA 
AGTTGCCTAC CAATGGGCCA AAGACTTAAA AATGCCATTG GTGATGATTA 

20 TATGTCTTTA GGTATTACTT CTTATAGTGG GCATACTGCA GCCCTCTATC 
CGGAAGTTGA TACAAAATAT GGTTTTCGAG TTGATAACTT CCAACTGCAG 
GAACCAAATG AAGGTTCTGT CGAGAAAGCT ATTTCTGGTT GTGGAGTTAC 
TAATTCTTTT GTCTTTTTTA GAAATATTCC TGAAGATTTA CAATCCATCC 
CGAACATGAT TCGATTTGAN TCTATTTACA TGAAAGCAGA ACTCGAGAAA 

25 GCTTTCGATG GAATATTTCA AATTGAAAAG TCATCTGTAT CTGAGGTCGT 
TTATGAATAA 



30 2) INFORMATION FOR SEQ ID NO: 1536 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1260 



35 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) 



MOLECULE TYPE: DNA 



40 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 1536 



AGATGTATTA ACTGGAAAAC AACAA 



45 



2) INFORMATION FOR SEQ ID NO: 1537 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1537 
CTTTGTAATT AGTTTCTGAA AACCA 



2) INFORMATION FOR SEQ ID NO: 1538 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1538 
TTAGAAGATA TAGGATACAA AATAGAAG 



2) INFORMATION FOR SEQ ID NO: 1539 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) . TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1539 
GAATGAAAAA GAAGTTGAGC TT 



2) INFORMATION FOR SEQ ID NO: 1540 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 486 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 
(C) ACCESSION NUMBER: M14039 • 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1540 



ATGAAAAATA ATAATGTAAC AGAAAAAGAA 
5 ATTTGAACAC ATGAAAGTAA CTTATTGGTT 
ATGTATTAAC TGGAAAACAA CAAAGAGAAC 
TTTGACGCTC AACACACTCA AAAAGTTATA 
ATACAAAATA GAAGTTCATT GGATGCCTTC 
AAGAATATGG GTATTTAGAT ATTCATCCTA 
10 TCAATTACCC AAGCAAACCC AGAAGGTGGT 
CTGGTTTTCA GAAACTAATT ACAAAGATCG 
AAGAAGCTCA ACTTCTTTTT CATTCTGGTT 
CATTTTGATA TAAAAAATTT AAAATCAATA 



TTATTTTATA TTTTAGATTT 50 

AGATGGTGGC TGGGGGGTAG 100 

ACAGAGATAT AGATATAGAT 150 

CAAAAATTAG AAGATATAGG 200 

ACGTATGGAA CTTAAGCATG 250 

TAAATCTAAA TGATGATGGA 300 

AATTATGTTT TCCAAAATGA 350 

AAAAATACCA TGTATTTCAA 4 00 

ATGATTTAAC AGAAACAGAC 450 

ACATAA 4 86 



15 



2) INFORMATION FOR SEQ ID NO: 1541 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1541 
■ TGATAATCTT ATACGTGGGG AATTT 25 

30 



20 



2) INFORMATION FOR SEQ ID NO: 1542 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid. 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1542 
45 ATAATTTTCT AATTGCCCTG TTTCAT 26 



2) INFORMATION FOR SEQ ID NO: 1543 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1543 



GGGCAATTAG AAAATTATTT ATCAGA 



2) INFORMATION FOR SEQ ID NO: 1544 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1544 
TTTTACTCAT GTTTAGCCAA TTATCA 



2) INFORMATION FOR SEQ ID NO: 1545 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 804 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
(C) ACCESSION NUMBER: AF110130 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1545 

ATGTTAAAAC AAAAAGAATT AATTGCAAAC GTTAAGAATC TTACTGAGTC 50 

AGATGAACGA ATTACAGCTT GTATGATGTA TGGATCGTTT ACCAAAGGAG 100 

AAGGTGACCA ATACTCTGAT ATAGAGTTCT ATATATTTTT GAAACATAGT 150 

ATAACCTCGA ACTTTGATTC ATCCAACTGG TTGTTTGACG TAGCTCCGTA 200 

CTTGATGCTT TATAAAAATG AGTACGGAAC AGAGGTAGTT ATTTTTGATA 250 

ATCTTATACG TGGGGAATTT CATTTCCTTT CTGAAAAAGA TATGAACATA 300 

ATCCCCTCGT TTAAAGATTC AGGTTATATT CCTGATACGA AGGCTATGCT 350 

TATTTACGAT GAAACAGGGC AATTAGAAAA TTATTTATCA GAGATAAGTG 400 

GTGCAAGACC AAATAGACTT ACTGAAGAAA ATGCTAATTT TTTGTTGTGT 450 

AATTTCTCTA ATCTATGGTT GATGGGAATC AACGTTCTAA AAAGAGGAGA 500 
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20 



25 



ATATGCTCGT TCATTAGAAC TCTTATCACA ACTTCAAAAA AATACACTAC 550 

AACTTATACG TATGGCAGAA AAAAATGCTG ATAATTGGCT AAACATGAGT 600 

AAAAACCTTG AAAAAGAAAT TAGCCTTGAA AATTATAAAA AATTTGCAAA 650 

GACCACTGCT CGATTAGATA AGGTAGAATT ATTTGAAGCC TATAAAAATT 700 

CTTTGCTATT AGTTATGGAT TTGCAAAGTC ACCTTATTGA ACAATACAAC 750 

TTAAAAGTTA CACATGACAT TTTAGAAAGA TTGTTGAATT ACATTAGTGA 800 

AT AG 804 



2) INFORMATION FOR SEQ ID NO: 154 6 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1546 
CAAGAAGGAA TGGCTGTACT AC 22 



2) INFORMATION FOR SEQ ID NO: 1547 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1547 
TAATTCCCAA ATAACCCTAA TAATAGA 27 

40 



2) INFORMATION FOR SEQ ID NO: 1548 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1218 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Streptococcus pyogenes 
(C) ACCESSION NUMBER: U70055 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1548 

5 

ATGGAAAAAT ACAACAATTG GAAACTTAAG TTTTATACAA TATGGGCAGG 50 

GCAAGCAGTA TCATTAATCA CTAGTGCCAT CTTGCAAATG GCGATTATTT 100 

TTTACCTTAC AGAAAAAACT GGATCCGCGA TGGTCTTGTC TATGGCTTCA 150 

CTATTAGGTT TTTTACCCTA TGCGGTCTTT GGACCTGCAA TTGGTGTGCT 200 

10 AGTGGATCGT CATGATAGGA AGAAGATAAT GATTGGTGCT GATTTAATTA 250 

TCGCAGCAGC TGGTTCGGTG CTTACTATTG TTGCATTCTA TATGGAGCTA 300 

CCTGTCTGGA TGGTTATGAT AGTATTGTTT ATCCGTAGCA TTGGAACAGC 350 

TTTTCACACC CCGGCTCTCA ATGCGGTTAC GCCACTTTTA GTACCAGAAG 4 00 

AACAGCTTAC GAAATGTGCA GGCTATAGTC AGTCTTTGCA GTCTATAAGC 4 50 

15 TATATTGTTA GTCCGGCGGT TGCAGCACTC TTATACTCCG TTTGGGAACT 500 

AAATGCTATT ATTGCCATCG ATGTATTGGG TGCTGTGATT GCATCTATTA 550 

CGGTAGCAAT TGTACGTATT CCTAAGCTGG GTGATCGCGT GCAAAGTTTG 600 

GACCCAAATT TCATAAGAGA AATGCAAGAA GGAATGGCTG TACTACGGCA 650 

AAATAAAGGA TTATTTGCTT TATTACTCGT TGGAACATTA TATATGTTTG 700 

20 TTTATATGCC AATTAATGCA CTATTCCCTT TAATTAGCAT GGATTACTTT 750 

AATGGAACAC CTGTGCATAT TTCTATTACG GAAATTTCCT TTGCATCTGG 800 

AATGTTGATA GGGGGTCTAT TATTAGGGTT ATTTGGGAAT TACCAAAAGC 850 

GAATCTTATT AATAACGGCA TCCATTTTTA TGATGGGGAT AAGCTTAACC 900 

ATTTCAGGAT TACTTCCCCA AAGTGGATTT TTCATTTTTG TAGTCTGCTG 950 

25 TGCAATAATG GGGCTTTCTG TTCCGTTTTA CAGCGGTGTG CAAACAGCTC 1000 

TTTTTCAGGA GAAAATTAAG CCTGAATATT TAGGACGTGT ATTTTCTTTA 1050 

ACTGGAAGTA TCATGTCTCT TGCTATGCCA ATTGGATTAA TTCTTTCTGC 1100 

ACTCTTTGCT GATAGAATCG GTGTAAATCA TTGGTTTTTA CTATCAGGTA 1150 

CTTTAATTAT TTGCATTGCA ATAGTTTGCC CAATGATAAA TGAGATTAGA 1200 

30 AAATTAGATT TAAAATAA 1218 



35 



2) INFORMATION FOR SEQ ID NO: 1549 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1549 

45 

GCTTATTATT AGGAAGATTA GGGGGC 26 



50 2) INFORMATION FOR SEQ ID NO: 1550 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1550 
TAGCAAGTGA CATGATACTT CCGA 24 



2) INFORMATION FOR SEQ ID NO: 1551 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1218 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
25 (C) ACCESSION NUMBER: U83667 

• (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1551 

ATGGAAAAAT ACAACAATTG GAAACGAAAA TTTTATGCAA TATGGGCAGG 50 

30 GCAAGCAGTA TCATTAATCA CTAGTGCCAT CCTGCAAATG GCGATTATTT 100 

TTTACCTTAC AGAAAAAACA GGATCTGCGA TGGTCTTGTC TATGGCTTCA 150 

TTAGTAGGTT TTTTACCCTA TGCGATTTTG GGACCTGCCA TTGGTGTGCT 200 

AGTGGATCGT CATGATAGGA AGAAGATAAT GATTGGTGCC GATTTAATTA 250 

TCGCAGCAGC TGGTGCAGTG CTTGCTATTG TTGCATTCTG TATGGAGCTA 300 

35 CCTGTCTGGA TGATTATGAT AGTATTGTTT ATCCGTAGCA TTGGAACAGC 350 

TTTTCATACC CCAGCACTCA ATGCGGTTAC ACCACTTTTA GTACCAGAAG 400 

AACAGCTAAC GAAATGCGCA GGCTATAGTC AGTCTTTGCA GTCTATAAGC 4 50 

TATATTGTTA GTCCGGCAGT TGCAGCACTC TTATACTCCG TTTGGGATTT 500 

AAATGCTATT ATTGCCATCG ACGTATTGGG TGCTGTGATT GCATCTATTA 550 

40 CGGTAGCAAT TGTACGTATA CCTAAGCTGG GTAATCAAGT GCAAAGTTTA 600 

GAACCAAATT TCATAAGGGA GATGAAAGAA GGAGTTGTGG TTCTGAGACA 650 

AAACAAAGGA TTGTTTGCCT TATTACTCTT AGGAACACTA TATACTTTTG 700 

TTTATATGCC AATCAATGCA CTATTTCCTT TAATAAGCAT GGAACACTTT 750 

AATGGAACGC CTGTGCATAT TTCTATTACG GAAATTTCCT TTGCATTTGG 800 

45 GATGCTAGCA GGAGGCTTAT TATTAGGAAG ATTAGGGGGC TTCGAAAAGC 850 

ATGTATTACT AATAACAAGT TCATTTTTTA TAATGGGGAC CAGTTTAGCC 900 

GTTTCGGGAA TACTTCCTCC AAATGGATTT GTAATATTCG TAGTTTGCTG 950 

TGCAATAATG GGGCTTTCGG TGCCATTTTA TAGCGGTGTG CAAACAGCTC 1000 

TTTTTCAGGA GAAAATTAAG CCTGAATATT TAGGACGTGT ATTTTCTTTG 1050 

50 ATCGGAAGTA TCATGTCACT TGCTATGCCA ATTGGGTTAA TTCTTTCTGG 1100 

ATTCTTTGCT GATAAAATCG GTGTAAATCA TTGGTTTTTA CTATCAGGTA 1150 

TTTTAATTAT TGGCATTGCT ATAGTTTGCC AAATGATAAC TGAGGTTAGA 1200 

AAATTAGATT TAAAATAA 1218 
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15 



2) INFORMATION FOR SEQ ID NO: 1552 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1552 
GGCAAGCAGT ATCATTAATC ACTA 24 



2) INFORMATION FOR SEQ ID NO: 1553 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE .TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1553 
CAATGCTACG GATAAACAAT ACTATC > 26 

30 



2) INFORMATION FOR SEQ ID NO: 1554 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1554 
45 AGAAAATTAA GCCTGAATAT TTAGGAC 27 



2) INFORMATION FOR SEQ ID NO: 1555 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

802 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1555 
TAGTAAAAAC CAATGATTTA CACCG 



2) INFORMATION FOR SEQ ID NO: 1556 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1556 
ACTGTACGCA CTTGCAGCCC GACAT 



2) INFORMATION FOR SEQ ID NO: 1557 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1557 
GAACGGCAGG CGATTCTTGA GCAT 



2) INFORMATION FOR SEQ ID NO: 1558 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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2) INFORMATION FOR SEQ ID NO: 1559 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1559 
GCCGCAGCGA GGTACTCTTC GTTA 



2) INFORMATION FOR SEQ ID NO: 1560 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 906 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: D16251 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1560 

ATGACCGTAG TCACGACCGC CGATACCTCC CAACTGTACG CACTTGCAGC 50 

CCGACATGGG CTCAAGCTCC ATGGCCCGCT GACTGTCAAT GAGCTTGGGC 100 

TCGACTATAG GATCGTGATC GCCACCGTCG ACGATGGACG TCGGTGGGTG 150 

CTGCGCATCC CGCGCCGAGC CGAGGTAAGC GCGAAGGTCG AACCAGAGGC 200 

GCGGGTGCTG GCAATGCTCA AGAATCGCCT GCCGTTCGCG GTGCCGGACT 250 

GGCGCGTGGC CAACGCCGAG CTCGTTGCCT ATCCCATGCT CGAAGACTCG 300 

ACTGCGATGG TCATCCAGCC TGGTTCGTCC ACGCCCGACT GGGTCGTGCC 350 

GCAGGACTCG GAGGTCTTCG CGGAGAGCTT CGCGACCGCG CTCGCCGCCC 400 

TGCATGCCGT CCCCATTTCC GCCGCCGTGG ATGCGGGGAT GCTCATCCGT 4 50 

ACACCGACGC AGGCCCGTCA GAAGGTGGCC GACGACGTTG ACCGCGTCCG 500 

ACGCGAGTTC GTGGTGAACG ACAAGCGCCT CCACCGGTGG CAGCGCTGGC 550 

TCGACGACGA TTCGTCGTGG CCAGATTTCT CCGTGGTGGT GCATGGCGAT 600 

CTCTACGTGG GCCATGTGCT CATCGACAAC ACGGAGCGCG TCAGCGGGAT 650 

GATCGACTGG AGCGAGGCCC GCGTTGATGA CCCTGCCATC GACATGGCCG 7 00 

CGCACCTTAT GGTCTTTGGT GAAGAGGGGC TCGCGAAGCT CCTCCTCACG 7 50 
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TATGAAGCGG CCGGTGGCCG GGTGTGGCCG 
GGAGCGCCTT GCGTTCGGGG CGGTCACCTA 
CGGGTAACGA AGAGTACCTC GCTGCGGCGA 
GAATGA 



CGGCTCGCCC ACCACATCGC 800 
CGCACTCTTC GCCCTCGACT 850 
AGGCGCAGCT CGCCGCAGCG 900 

906 



2) INFORMATION FOR SEQ ID NO: 1561 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1048 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: ATCC 18804 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1561 

GATCATGGTA AAACTACATT GACTGCTGCT ATCACCAAAG TTTTAGCCGA 50 

ACAAGGTGGT GCCAACTTCT TGGATTAYGG TTCTATTGAT AGAGCTCCAG 100 

AAGAAAGAGC TAGAGGTATC ACTATTTCCA CTGCCCACGT TGAATACGAA 150 

AC C AAGAAC A GACACTATGC CCACGTTGAT TGTCCAGGAC ACGCTGATTA 200 

TATCAAAAAT ATGATTACTG GTGCCGCTCA AATGGATGGT GCTATCATTG 250 

TTGTTGCTGC CACTGATGGT CAAATGCCTC AAACCAGAGA ACATTTGTTA 300 

TTGGCCAGAC AAGTTGGTGT TCAAGACTTG GTTGTGTTTG TCAACAAAGT 350 

CGATACTATT GATGACCCTG AAATGTTGGA ATTAGTCGAA ATGGAAATGA 400 

GAGAATTGTT ATCCACCTAC GGTTTTGATG GTGACAACAC TCCAGTTATT 450 

ATGGGATCTG CTTTAATGGC TTTGGAAGAC AAGAAACCAG AAATTGGTAA 500 

GGAAGCTATC TTGAAATTGT TAGATGCTGT CGATGAACAC ATTCCAACTC 550 

CATCAAGAGA CTTGGAACAA CCATTTTTGT TACCAGTTGA AGACGTGTTC 600 

TCCATCTCCG GTAGAGGAAC TGTTGTCACT GGTAGAGTTG AAAGAGGTGT 650 

TTTGAAGAAG GGTGAAGAAA TCGAAATTGT TGGTGGTTTT GACAAACCTT 700 

ACAAGACTAC TGTTACCGGT ATTGAAATGT TCAAAAAAGA ATTAGACTCT 750 

GCTATGGCTG GTGACAACTG TGGTGTTTTG TTAAGAGGTG TTAAAAGAGA 800 

TGAAATCAAG AGAGGTATGG TTTTGGCCAA ACCAGGTACT GCCACTTCTC 850 

ACAAGAAGTT CTTGGCTTCC TTGTATATTT TGACTTCCGA AGAAGGTGGY 900 

CGTTCCACTC CATTTGGTGA AGGTTACAAG CCTCAATGCT TCTTCAGAAC 950 

TAACGATGTC ACTACCACAT TTTCATTCCC AGAAGGAGAA GGTGTTGACC 1000 

ATTCTCAAAT GATCATGCCA GGTGACAACA TTGAAATGGT TGGTGAAT 1048 



2) INFORMATION FOR SEQ ID NO: 1562 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1074 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida dubliniensis 

(B) STRAIN: NCPF 3949 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1562 

TGATCACGGT AAAACCACAT TAACTGCTGC CATTACCAAA GTATTAGCTG 50 

AACAAGGTGG TGCCAACTTT TTGGATTACG GTTCCATTGA TAGAGCTCCA 100 

GAAGAAAGAG CCAGAGGTAT CACTATTTCC ACTGCCCACG TTGAATACGA 150 

AACCAAGAAC AGACACTATG CCCACGTTGA TTGTCCAGGA CACGCTGATT 200 

ATATCAAAAA CATGATTACT GGTGCTGCTC AAATGGATGG TGCTATCATT 250 

GTTGTTGCTG CTACTGACGG TCAAATGCCA CAAACCAGAG AACATTTATT 300 

GTTGGCAAGA CAAGTTGGTG TTCAAGACTT GGTTGTCTTT GTCAACAAAG 350 

TTGATACTAT TGATGACCCT GAGATGTTGG AATTAGTCGA AATGGAAATG 400 

AGAGAATTGT TGTCCACCTA CGGTTTTGAT GGTGACAACA CTCCTGTTAT 450 

TATGGGATCT GCTTTAATGG CCTTGGAAGG CAAAAAACCA GAAATTGGTA 500 

AGGAAGCTAT TTTGAGATTG TTAGATGCTG TCGATGAACA CATTCCAACT 550 

CCATCAAGAG ACTTGGAACA ACCATTTTTG TTGCCAGTTG AAGACGTGTT 600 

CTCCATCTCT GGTAGAGGAA CTGTTGTCAC CGGTAGAGTT GAAAGAGGTG 650 

TCTTGAAGAA GGGTGAAGAA ATCGAAATTG TTGGTGGTTT TGACAAACCA 700 

TACAAGACCA CTGTTACTGG TATTGAAATG TTCAAAAAGG AATTAGATTC 750 

TGCTATAGCT GGTGACAACT GTGGTGTTTT GTTGAGAGGT GTTAAAAGAG 800 

ATGAAATCAA GAGAGGTATG GTTTTGGCCA AGCCAGGTAC TGCTACTTCT 850 

CACAAGAAAT TTTTAGCATC TTTGTATATT TTGACTTCAG AAGAAGGTGG 900 

TCGTTCCACT CCATTTGGAG AAGGTTACAA GCCTCAATGT TTCTTCAGAA 950 

CTAATGACGT CACTACCACA TTTTCATTCC CAGAAGGAGA AGGTGTTGAC 1000 

CACTCCCAAA TGGTCATGCC AGGTGATAAC ATTGAAATGG TTGGTGAATT 1050 

GATCAAATCA TGTCCATTGG AAGT 1074 



2) INFORMATION FOR SEQ ID NO: 1563 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1033 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida famata 

(B) STRAIN: ATCC 62894 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1563 

GATCACGGGA AGACTACTTT GACCGCTGCC ATCACCAAAG TTTTAGCCGA 50 
AAAAGGTGGT GCTAACTTCT TGGACTACGG TTCTATCGAT AAAGCTCCAG 100 
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AAGAAAGAGC CAGAGGTATT ACTATTTCTG CTGCCCATGT TGAATACGAA 150 

ACTGACAAGA GACACTATGC CCATGTTGAT TGTCCAGGTC ACGCAGATTA 2 00 

TATCAAGAAT ATGATTACTG GTGCTGCTCA AATGGATGGT GCCATTATTG 250 

TTGTTGCTGC TTCCGATGGT CAAATGCCTC AAACCAGAGA ACATTTGTTA 300 

5 TTGGCCAGAC AAGTTGGTGT TCAACACTTG GTTGTTTTCG TCAACAAGGT 350 

CGACACCATT GACGATCCAG AAATGTTGGA ATTGGTTGAA ATGGAAATGA 400 

GAGATTTGTT AACTACTTAC GGTTTTGATG GTGATAACAC CCCAGTTATC 450 

ATGGGATCTG CTTTGTGTGC TTTGGAATCC AGAGAACCAG AAATTGGTCA 500 

AAAAGCCATT GAAAAATTGT TAGATGCCGT CGATGAATAC ATTCCAACCC 550 

10 CAGTCAGAGA CTTGGAACAA CCATTCTTGA TGCCAGTTGA AGAAGTTTTC 600 

TCCATTTCCG GTAGAGGTAC CGTTGTTGCT GGTAGAGTCG AAAGAGGTAC 650 

CTTGAAGAAG GGTGAAGAAA TCGAAATTGT TGGTGGTTTC GACAAGCCAT 700 

TCAAGGCCAC TGTTACTGGT ATTGAAATGT TCAAGAAGGA ATTGGACTCC 750 

GCTCTTGCTG GTGACAACTG TGGTATCTTG TTGAGAGGTG TCAAGAGAGA 800 

15 CGAAGTTAAG AGAGGTATGG TCTTGACCAA GCCAAACACC GTCACTTCCC 850 

ACAAGAAGAT CTTGGCCTCG TTGTATATCT TGACCAAGGA AGAAGGTGGT 900 

AGACACTCTC CATTTGGAGC CAACTACAAG CCCCAATTGT TCATGAGAAC 950 

CACCGATGTT ACCGGTACCA TGACCTTCCC AGAAGGTGCC GACCAATCTG 1000 

CCATGGTCAT GCCAGGTGAC AACGTTGAAA TGC 1033 

20 



2) INFORMATION FOR SEQ ID NO: 1564 



25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1056 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida glabrata 
35 (B) STRAIN: ATCC 66032 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1564 



GATCACGGTA AGACTACATT GACAGCTGCT ATCACCAAGA CATTGGCCAA 50 

40 GAACGGTGGT GCTGATTTCT TGGACTACTC TTCCATTGAC AAAGCTCCAG 100 

AGGAGAGAGC CCGTGGTATC ACTATCTCTA CTGCCCATGT CGAGTACGAG 150 

ACCGCCAAGA GACATTACTC CCACGTCGAC TGTCCAGGTC ACGCCGACTA 2 00 

CATCAAGAAC ATGATTACTG GTGCTGCCCA AATGGACGGT GCTATCATCG 250 

TTGTCGCCGC CACCGATGGT CAAATGCCAC AAACTAGAGA GCATTTGCTG 3 00 

45 TTGGCCAGAC AAGTCGGTGT TCAACGTATC GTTGTCTTTG TCAACAAGGT 350 

GGACACCATC GATGACCCTG AAATGTTGGA ATTAGTGGAA ATGGAAATGA 4 00 

GAGAATTGTT GAACGAATAC GGTTTTGACG GTGACAATGC CCCTATCATT 450 

ATGGGTTCCG CTTTGTGTGC CCTAGAAGGT CGTCAACCTG AAATTGGTGA 500 

GCAAGCTATC ATGAAACTAT TGGACGCTGT TGATGAATAC ATTCCAACCC 550 

50 CAGAAAGAGA CTTGAACAAG CCATTCTTGA TGCCTGTTGA AGACATCTTC 600 

TCCATCTCTG GTAGAGGTAC CGTCGTCACT GGACGTGTCG AAAGAGGTAA 650 

CTTGAAGAAG GGTGAAGAAG TTGAAATTGT TGGTCACAAC ACTACCCCAT 700 

TGAAGACCAC CGTTACTGGT ATCGAAATGT TCAGAAAGGA ATTGGACCAA 750 
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GCTATGGCTG GTGACAACGC 
CCAATTGAAG AGAGGTATGG 
ACACCAAGAT TTTGGCTTCT 
AGACATTCTG GTTTCGGTGA 
CGCAGATGTC ACTGTTGTGA 



CGGTATCCTA TTGAGAGGTA TCAGAAGAGA 
TCATGGCCAA GCCAGGTACC GTCAAGGCTC 
TTGTACATCT TGTCTAAGGA AGAAGGTGGT 
AAACTACAGA CCTCAGATGT TTATCAGAAC 
TGAAGTTCCC AGAATCTGTG GAAGACCACT 



800 
850 
900 
950 
1000 



10 



15 



20 



25 



30 



35 



40 



45 



CTATGCAAGT TATGCCAGGT GACAACGTCG AAATGGTCTG TGAACTAGTC 1050 



2) INFORMATION FOR SEQ ID NO: 1565 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1061 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida guilliermondii 

(B) STRAIN: ATCC 6260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1565 

GATCATGGTA AGACCACTTT GACCGCTGCT ATCACCAAGG TTTTGTCGGA 50 

AAAAGGAGGT GCTAATTTCT TGGATTACGG CTCCATCGAC AGAGCTCCAG 100 

AAGAGAGAGC CAGAGGTATC ACCATTTCCA CTGCCCATGT TGAGTACCAA 150 

ACTGATAAGA GACATTATGC CCACGTTGAC TGTCCAGGTC ACGCCGATTA 200 

CATTAAGAAT ATGATTACTG GTGCCGCCCA GATGGACGGT GCCATTATTG 250 

TTGTTGCTGC CACTGACGGT CAAATGCCTC AGACCAGAGA GCACTTGTTG 300 

TTGGCCAGAC AAGTTGGTGT GCAACACTTG GTAGTTTTTG TGAACAAGGT 350 

GGACACCATT GACGATCCCG AGATGTTGGA ATTGGTCGAG ATGGAAATGA 400 

GAGAATTGTT GAGTCAGTAC GGTTTCGATG GTGACAACAC CCCAGTTATC 450 

ATGGGATCTG CTTTGTGTGC TTTGGAAAGT AAGCAGCCAG AAATTGGTGT 500 

GCAAGCCATT GAAAAATTGT TGGACGCTGT CGATGAGCAC ATTCCTACTC 550 

CTACCCGTGA CTTGGAACAG CCATTCTTGT TGCCTGTTGA AGATGTGTTC 600 

TCCATTTCTG GTAGAGGAAC TGTGGTTACT GGTAGAGTCG AAAGAGGTTC 650 

GTTGAAGAAG GGTGAGGAAA TCGAGATTGT TGGTGACTTT GACAAGCCAT 700 

TCAAGACCAC TGTGACTGGA ATTGAAATGT TCAAGAAGGA ATTGGATGCT 750 

GCTATGGCTG GTGACAATGC TGGTATCTTG TTGAGAGGTG TCAAGAGAGA 800 

CGATGTCAAG AGAGGTATGG TTTTGGCCAA GCCTTCCACC GTCACTTCTC 850 

ACAAGAAGGT GTTGGCTTCC TTGTACATCT TGAGTAAGGA AGAAGGTGGC 900 

CGTCACTCTC CTTTTGGTGA GAACTACAAG CCTCAATTGT TCATCAGAAC 950 

TACTGACGTT ACCGGTACTT TAAGATTCCC AGCCGGCGAG GGTGTCGACC 1000 

ACTCGCAAAT GGTTATGCCA GGTGACAATG TTGAGATGGA AATTGAGCTT 1050 

GTGAGAAAGA C 1061 



2) INFORMATION FOR SEQ ID NO: 1566 



CACCCA 



1056 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1073 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Candida haemulonii 

(B) STRAIN: ATCC 22991 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1566 

15 GATCACGGTA AGACTACCTT GACTGCTGCT ATCACCAAGG TTTTGGCTTC 50 

TAAGGGTGGT GCTAGCTTCT TGGACTATGG TTCCATTGAC AGAGCCCCAG 100 

AGGAGAGAGC TAGAGGTATT ACTATTTCTA CTGCCCACGT TGAGTACCAA 150 

ACCGAAAAGA GACACTACGC CCACGTCGAC TGTCCAGGTC ACGCTGATTA 200 

CATTAAGAAT ATGATTACTG GTGCCGCCCA GATGGACGGT GCTATCATTG 250 

20 TTGTTGCTGC TTCTGATGGT CAGATGCCTC AGACCAGAGA GCACCTTTTG 300 

TTGGCCAGAC AGGTTGGTGT TCAGAACTTG GTTGTTTTCG TTAACAAGGT 350 

TGACACCATT GACGACCCTG AAATGTTGGA ATTGGTTGAG ATGGAAATGA 400 

GAGAATTGTT GACTACTTAC GGTTTTGACG GTGATGAGAC TCCTGTTATC 450 

ATGGGTTCTG CTTTGTGCGC TTTGGAAGAG AAGCAACCAG AGATTGGTGA 500 

25 GCAGGCTATC ATGAAGTTGT TGGACGCTGT CGATGAGTAC ATTCCAACCC 550 

CACAGCGTGA CTTGGAGCAG CCATTCTTGA TGCCTGTTGA GGATGTTTTC 600 

TCCATTTCTG GTAGAGGTAC TGTCGTTACT GGTAGAGTTG AGAGAGGTTC 650 

TTTGAAGAAG GGTGAGGAAA TCGAGATTGT CGGTGACTTC GCCAAGACTT 700 

TCAAGGCTAC CGTTACTGGT ATTGAGATGT TCAAGAAGGA ATTGGATGCT 750 

30 GCTATGGCTG GTGACAACGC CGGTATCTTG TTGAGAGGTG TCAAGAGAGA 800 

TGAGATCTCC CGTGGTGATG TCTTGGCCAA GCCAGGTACT GTTACTCCAC 850 

ACAAGAAGAT CTTGGCTTCT TTGTACGTTT TGACCAAGGA AGAAGGTGGT 900 

CGTCACAACC CATTCGCTGA GAACTACAAG CCACAGTTGT TCCTCAGAAC 950 

CACCAACGTC ACTGGTACCA TGAGATTCCC AGAAGGTGAA GATGTTGACC 1000 

35 ACTCTGCCAT GGTTAACCCA GGTGACAACG TTGAGATGGA AATCGAGTTG 1050 

GGTAGAAAGG CCCCACTTGA GTT 1073 



40 2) INFORMATION FOR SEQ ID NO: 1567 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1062 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida kefyr 

(B) STRAIN: ATCC 28838 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1567 

CATGGTAAGA CCACTTTGAC TGCTGCCATC ACCAAGACTC TAGCTGAACG 50 

TGGTGGTGCT GACTTTTTGG ACTACTCTTC TATTGACAAG GCTCCAGAAG 100 

5 AAAGAGCYAG AGGTATCACT ATTTCTACTG CTCATGTTGA ATACGAGACT 150 

GAAAAGAGAC ATTACTCCCA CGTTGACTGT CCAGGTCACG CTGATTACAT 200 

CAAGAACATG ATTACTGGTG CTGCTCAAAT GGACGGTGCT ATTATTGTTG 250 

TTGCTGCTAC TGATGGTCAA ATGCCTCAAA CCAGAGAGCA TTTGTTGTTG 300 

GCCAGACAAG TTGGTGTCCA ACACATTGTT GTTTTCGTTA ACAAGGTTGA 350 

10 CACCATCGAT GATCCAGAAA TGTTGGAATT GGTTGAAATG GAAATGAGAG 4 00 

AATTRTTGAC TCAATATGGC TTTGACGGTG ACAACACTCC AGTGATCATG 450 

GGTTCTGCTT TGTGTGCCTT GGAAGGTAAG CAACCAGAAA TTGGTGAGCA 500 

AGCCATCATG AAGTTGTTGG ACGCTGTTGA CGAATACATC CCAACCCCAG 550 

CCCGTGACTT GGAAAAVCCA TTCTTGATGC CTGTTGAAGA TATCTTCTCC 600 

15 ATTTCCGGTA GAGGTACTGT CGTCACTGGT AGAGTTGAAC GTGGTAACTT 650 

GAAGAAGGGT GAAGAAATCG AAATTGTTGG TCACAACACC ACTCCTTTCA 700 

AGACTACTGT TACTGGTATT GAAATGTTCA GAAAGGAATT GGACCAAGCC 750 

ATGGCTGGTG ACAACGCTGG TGTCCTTTTG AGAGGTGTCA GAAGAGACCA 800 

ATTGAAGAGA GGTATGGTTT TGGCTAAGCC AGGTACTGTT AAGGCCCACA 850 

20 CCAAGTTCTT GGCTTCCTTG TACATTTTGA CCAAGGAAGA AGGTGGTAGA 900 

CACTCCGGTT TCGGTGAAAA CTACAGACCA CAAATCTACG TCAGAACTGC 950 

TGACGTTACC GTDGTCTTGA AGTTCCCAGA ATCTGTTGAA GACCATTCCA 1000 

TGCAAGTCAT GCCAGGTGAC AATGTCGAAA TGGAGTGTGA ATTGGTTCAC 1050 

CCAACTCCAT TG 1062 

25 



2) INFORMATION FOR SEQ ID NO: 1568 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1062 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM : Candida lusitaniae 
40 (B) STRAIN: ATCC 66035 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1568 



CGGAAAGACC ACCTTGACCG CCGCCATTAC 
45 GTGGCGCCAA CTTCTTGGAC TACGGTGCCA 
AGAGCACGTG GTATCACCAT TTCCACTGCC 
CAACAGACAC TACGCCCACG TTGACTGTCC 
AGAACATGAT CACGGGTGCC GCTCAAATGG 
GCAGCCACCG ACGGCCAAAT GCCTCAAACC 
50 CAGACAAGTT GGTGTGCAAC ACTTGGTTGT 
CCATCGACGA CCCAGAAATG TTGGAATTGG 
TTGTTGACTC AATACGGATT TGACGGCGAT 
CTCTGCTCTT TGCGCTTTGG AAGGTAGAGA 

81 



CAAGGTTTTG GCTGACAAGG 50 

TTGACAAGGC TCCTGAAGAA 100 

CACGTTGAAT ACGAGACCGA 150 

AGGTCACGCC GATTACATCA 200 

ACGGTGCCAT TATTGTTGTT 250 

AGAGAGCACT TGTTGTTGGC 300 

TTTCGTGAAC AAGGTTGATA 350 

TTGAAATGGA AATGAGAGAA 400 

GAAACCCCTG TTGTTATGGG 450 

ACCAGAGATT GGTGAGCAAG 500 
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AGAAGGGTGA 
ACTACTGTTA 
GGCTGGTGAC 
TTTCCCGTGG 
AAGGTTTTGG 
CTCTCCATTT 
ATGTCACTGG 
GCTATGGTTT 
AAAGACTCCT 



CCATCACCAA 
CGTGACTTGG 
TTCTGGTAGA 



GTTGTTGGAG 
AACAACCATT 
GGTACTGTTG 
GGAGATCGAG 
CTGGTATTGA 
AATGCTGGTA 
TATGGTTTTG 
CTTCTTTGTA 
GGCGAGAACT 
TACTTTGAGA 
CTCCAGGTGA 
CT 



GCTGTTGACG AGTACATCCC AACCCCACAA 
CTTGATGCCT GTTGAAGATG TTTTCTCCAT 
TCACTGGTAG AGTGGAGAGA GGTTCCTTGA 
ATTGTTGGTG ACTTTGACAA GCCTTTCAAG 
GATGTTCAAG AAGGAATTGG ACGCTGCTAT 
TCTTGTTGAG AGGTGTCAAG AGAGAACAAG 
GCCAAGCCAG GCACTGTGAC CTCGCACAAG 
CATTTTGTCT AAGGAAGAAG GTGGTCGTCA 
ACAAGCCTCA ATTGTTCCTT AGAACTACCG 
TTCCCAGCAG GTGAGGACGT TGACCACTCC 
CAATGTCGAG ATGGAAATCG AGTTGGTCAG 



550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1062 



10 



15 



20 



25 



30 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1569 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 990 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM:. Candida sphaerica 

(B) STRAIN: ATCC 2504 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1569 

TCACCAAGAC TTTGGCTGAA CGTGGTGGTG CTGATTTCTT GGACTACTCT 50 

TCCATTGACA AGGCTCCAGA AGAAAGAGCR AGAGGTATCA CTATTTCTAC 100 

TGCACATGTT GAATATGAAA CTGACAAGAG ACATTACTCT CACGTCGACT 150 

GTCCAGGTCA TGCTGATTAC ATCAAGAATA TGATTACTGG TGCTGCCCAA 200 

ATGGATGGTG CTATCATTGT TGTTGCTGCT ACAGATGGTC AAATGCCTCA 250 

AACCAGAGAA CATTTGTTGT TGGC TAGACA AGTTGGTGTT CAACAYATCG 300 

TTGTTTTCGT TAACAAGGTT GACACTATCG ATGACCCTGA AATGTTAGAA 350 

TTGGTTGAAA TGGAAATGAG AGAATTATTG ACCCAATACG GTTTCGATGG 400 

TGACAACACT CCAGTCATCA TGGGTTCTGC TTTGTGTGCT TTAGAAGGTA 450 

AGCAACCAGA AATTGGTGAA CAAGCAATCA TGAAGTTATT GGACGCTGTT 500 

GACGAATACA TCCCAACTCC AGCTCGTGAT TTGGAAAAGC CTTTCTTGAT 550 

GCCTGTTGAA GATATCTTCT CCATCTCCGG TAGAGGTACC GTCGTAACTG 600 

GTAGAGTTGA ACGTGGTAAC TTGAAKAAGG GTRAAGAAAT CGAAATCGTT 650 

GGTCACAACA CCACTCCATT CAAGACCACT GTTACTGGTA TTGAAATGTT 700 

CAGAAAGGAA TTGGACCAAG CTATGGCTGG TGATAACGCT GGTGTCWTGT 750 

TGAGAGGTGT CAGAAGAGAC CAATTAAAGA GAGGTATGGT CTTGGCCAAG 800 

CCAGGTACTG TCAAGGCTCA CACCRAATTC TTGGCCTCTT TGTATATCTT 850 

GACCAAGGAA GAAGGTGGTA GACATTCCGG TTTCGGTGAA AATTACAGAC 900 

CTCAAATCTA CGTTAGAACT GCTGATGTCA CCGTTGTTTT GAAGTTCCCA 950 

GAAGCTGTTG AAGATCACTC TATGCAAGTC ATGCCAGGTG 990 
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2) INFORMATION FOR SEQ ID NO: 1570 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1184 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida tropicalis 

(B) STRAIN: ATCC 750 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1570 

GATCATGGTA AAACCACTTT GACTGCTGCC ATTACTAAAG TCTTGGCTGA 50 

TAAAGGTCAA GCTAACTTCT TAGATTACGG TTCTATTGAT AGAGCTCCAG 100 

AAGAAAGAGC CAGAGGTATC ACCATCTCTA CTGCCCACGT TGAATATGAA 150 

2 0 ACCGAAAAAA GACATTACGC CCATGTTGAG TATGTATACT TTTTTTTGTT 200 

GTGTAATTGT TTTAAAGATT TTCTTTAAAG CTGAAGAAGT CAAATCAGTT 250 

CTTTGATAAC TTCTATTAAA AAAAGGGAAA AATTAACAAG ATATACTAAC 300 

ACTATAACAG TTGTCCTGGA CATCAAGATT ATATCAAGAA TATGATTACC 350 

GGTGCCGCTC AAATGGATGG TGCTATTATT GTTGTTGCTG CCACTGATGG 400 

25 TCAAATGCCA CAAACCAGAG AACATTTGTT GTTGGCTAGA CAAGTCGGTG 450 

TTCAAGATTT GGTTGTCTTT GTTAACAAAG TCGACACTAT TGATGACCCA 500 

GAAATGTTGG AATTGGTTGA AATGGAAATG AGAGAATTAT TGACTACTTA 550 

CGGTTTTGAT GGTGATAACA CTCCTGTTAT CATGGGTTCT GCTTTGATGG 600 

CCTTGCAAGG TAAACAACCA GAAATTGGTG AACAAGCTAT CATGAAATTG 650 

30 ATGGACGCTA TTGATGAACA CATTCCAACC CCAACCAGAG ACTTGGAACA 700 

ATCTTTCTTG ATGCCAGTTG AAGATGTTTT CTCCATTTCT GGTAGAGGTA 750 

CTGTTGTTAC TGGTAGAGTC GAAAGAGGTG TCTTAAAGAA GGGTGAAGAA 800 

ATTGAAATTG TTGGTGGTTT CGAAAAACCA TTCAAGACCA CTGTTACTGG 850 

TATTGAAATG TTCAAGAAAG AATTAGATGC TGCTATGGCT GGTGACAACT 900 

35 GTGGTGTCTT GTTGAGAGGT GTCAAGAGAG ACGAAATCAA GAGAGGTATG 950 

GTTTTGGCTA AACCAGGTAC TGCTACTTCC CACAAGAAAT TCTTGGCTTC 1000 

CATGTATATC TTAACTGCTG AAGAAGGTGG TCGTTCCACT CCATTCGGTG 1050 

AAGGTTACAA GCCACAATGT TTCTTCAGAA CTAACGATGT TACCACTTCC il00 

TTCTCTTTCC CAGAAGGTGA AGGTGTTGAC CACTCCCAAA TGGTTATGCC 1150 

40 AGGTGACAAC ATTGAAATGG TCGGTGAATT GATT 1184 



45 



2) INFORMATION FOR SEQ ID NO: 1571 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1071 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



812 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida viswanathii 

(B) STRAIN: ATCC 28269 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1571 

CGATCACGGT AAGACCACCT TGACCGCCGC CATCACCAAG GTCTTGGCCG 50 

ACAAGGGTCA GGCTAACTTC TTGGACTACG GATCCATTGA CAGAGCCCCT 100 

GAAGAAAGAG CAAGAGGTAT CACTATCTCC ACTGCCCACG TTGAATACGA 150 

GACTGATAAG AGACACTATG CCCACGTTGA TTGCCCGGGC CATCAAGATT 200 

ATATCAAGAA TATGATCACT GGTGCTGCCC AAATGGACGG TGCTATCATT 250 

GTTGTTGCTG CTACTGACGG TCAGATGCCA CAAACCAGAG AACACTTGTT 300 

GTTGGCTAGA CAAGTTGGTG TCCAAGACTT GGTTGTTTTC GTTAACAAGG 350 

TTGACACTAT CGATGACCCA GAAATGTTGG AATTGGTTGA AATGGAAATG 400 

AGAGAATTAT TATCTTCTTA YGGCTTTGAC GGTGACAACA CCCCAGTTGT 450 

CATGGGTTCC GCTTTGATGG CTTTGCAAGG TAAGCAACCA GAAATTGGTG 500 

AACAAGCTAT TATTAAGTTG ATGGACGCTA TTGATGAACA CATYCCAACC 550 

CCAACCAGAG ACTTGGAACA ACCATTCTTG TTGCCAGTTG AAGATGTCTT 600 

TTCTATTTCC GGTAGAGGTA CCGTCGTCAC TGGTAGAGTC GAAAGAGGTG 650 

TCTTGAAGAA GGGTGAAGAA ATTGAAATTG TCGGTAACTT TGAAAAGCCA 700 

TTCAAGACCA CCGTTACTGG TATTGAAATG TTCAAGAAGG AATTGGATGC 750 

TGCTATGGCT GGTGACAACT GTGGTGTCTT GTTGAGAGGT GTCAAGAGAG 800 

ACGAAATCAG CAGAGGTATG GTTTTGGCCA AGCCAGGTAC CGTCACTTCC 850 

CACAAGAAGT TCTTGGCCTC CATGTACATC TTGACTGGTG AAGAAGGTGG 900 

TCGTCGTACC CCATTCGGTG AAGGTTACAA GCCACAATGT TTCTTCAGAA 950 
CCAATGACAT CACCACCACT TTCACTTTCC CAGAAGGTGA AGGTGTCGAC . 1000 

CACTCCCAAA TGGTTATGCC AGGTGACAAC ATCGAAATGG TTGGTGAATT 1050 

GTACAAGGCT TGTCCTTGGA A 1071 



2) INFORMATION FOR SEQ ID NO: 1572 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acicl 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alcaligenes faecalis subsp. faecalis 

(B) STRAIN: ATCC 8750 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1572 

TATCTTGGTT WGCTCGGCCG CTGACGGCCC AATGCCTCAG ACTCGCGAGC 50 

ACATCCTGCT GAGCCGTCAG GTTGGCGTTC CTTACATCAT CGTGTTCCTG 100 

AACAAGGCCG ACATGGTTGA TGACGAAGAG CTGATCGAAC TGGTTGAAAT 150 

GGAAGTTCGC GAGCTGTTGT CCAAGTACGA CTTCCCTGGC GACGACACCC 2 00 

CGATCATCAA GGGTTCGGCC AAACTGGCTC TGGAAGGCGA CGAAGGCCCA 250 

CTGGGCAGCC AAGCCGTTCT GGCTCTGGCC GAAGCGCTGG ACAACTACAT 3 00 

813 
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TCCTACGCCT GAGCGTGCCG TTGACGGTAC 
ACGTGTTCTC GATCTCCGGC CGTGGTACGG 
CGCGGCATCA TCAAGGTCGG CGAAGAAATC 
CACGGTCAAG ACCATTTGTA CCGGCGTTGA 
5 ACCAGGGCGA AGCTGGCGAT AACGTCGGTC 
CGTGAAGACG TGGAACGTGG TCAAGTTCTG 
GCCACACACT GACTTCGACG CCGAGGTGTA 
GTGGTCGTCA CACTCCTTTC TTCAAGGGCT 
CGTACAACTG ACGTGACCGG CACCATCGAG 
10 GGTTCTGCCA GGCGACAACA TTTCGATGAA 
TCGCCATGGA AGAAGGT 
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GTTCCTGATG CCTGTTGAAG " 350 

TTGTGACCGG TCGTATTGAG 400 

GAAATCGTGG GTATCAAAGA 450 

AATGTTCCGC AAACTGCTGG 500 

TGCTGCTGCG TGGTACCAAG 550 

GCCAAGCCAG GCTCGATCAA 600 

CATTCTGTCC AAAGAAGAAG 650 

ACCGTCCTCA GTTCTACTTC 700 

CTGCCAGAAG ACAAGGAAAT 750 

AGTGTCCCTG ATCGCTCCTA 800 

817 



15 2) INFORMATION FOR SEQ ID NO: 1573 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Prevotella buccalis 

(B) STRAIN: ATCC 35310 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1573 



TATCCTTGTA GTTGCTGCTA CTGATGGTCC TATGCCACAG ACACGTGAGC 50 

ACGTGCTTTT GGCTCGTCAG GTAAACGTTC CTCGTTTGGT TGTGTTCATG 100 

AACAAGTGTG ACTTGGTAGA AGACGAAGAG ATGCTTGAAC TCGTTGAAAT 150 

GGAGTTGCGC GAACTTCTTG AGCAATACGA ATTCGAAGAG GATACTCCAA 200 

35 TCGTTCGTGG TTCTGCACTG GGTGCATTGA ATGGTGTTGA CAAGTGGGTT 250 

GACAGCGTGA TGACGTTGAT GGACACTGTT GACGAGTGGA TTCAAGAGCC 300 

AGAGCGTGAC CTTGACAAAC CTTTCTTGAT GCCAGTAGAG GACGTGTTCT 350 

CTATCACAGG TCGTGGTACC GTTGTAACAG GACGTATTGA GACTGGTAAG 400 

GTAAAGGTTG GCGACGAGAT TCAGTTGCTC GGTCTTGGTG AGGACAAGAA 450 

40 GTCTGTTGTA ACAGGCGTTG AAATGTTCCG TAAGATTCTT TCTGAAGGTG 500 

AAGCAGGTGA TAACGTAGGA CTTTTGCTCC GCGGTATCGA TAAGGATGAA 550 

GTAAAGCGTG GTATGGTTGT TGTACACCCA GGTGCCATCA CTCCTCACGA 600 

TCACTTCAAG GCTTCCATCT ATGTATTGAA GAAGGAAGAG GGTGGACGTC 650 

ATACTCCATT CGGAAACAAG TATCGTCCTC AGTTCTATCT CCGTACAATG 700 

45 GACTGTACAG GTGAGATCAC TTTGCCAGAA GGCGTAGAGA TGGTGATGCC 750 

TGGTGACAAC GTAGAGATTG AGGTTACCTT GATTTACAAG GTTGCC 796 



50 2) INFORMATION FOR SEQ ID NO: 1574 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 820 bases 
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SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Succinivibrio dextrinosolvens 

(B) STRAIN: ATCC 19716 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1574 

GCTATTCTAG TAGTAGCAGC AACTGATGGT CCTATGCCAC AGACCCGTGA 50 

GCACATCCTA TTAGCACGTC AGGTAGGCGT ACCATACATC ATCGTATTCC 100 

TAAACAAGTG CGATATGGTT GACGAC GAGG AATTATTAGA GTTAGTTGAG 150 

ATGGACGTAC GTGATCTATT AAATCAGTAC CAGTTCCCAG GCGACGACAC 200 

TCCAATCATC CGTGGTTCAG CACTAGGTGC ATTAAACGGC GAAGAGAAGT 250 

GGAAAGAGGC AATCTATCAG TTAGCAGACA CTCTAGATTC ATACATTCCA 300 

GAGCCAAAGC GTGATATCGA TGATCCATTC CTATTACCAA TCGAAGATAT 350 

CTTCTCAATC TCAGGTCGTG GTACTGTAGT AACCGGCCGT GTAGAGCGTG 400 

GTATTGTACA CGTAGGTGAC GAAGTTGAAA TCGTTGGTAT TCGTCCAACC 450 

ACCAAGACCA CTGTAACTGG CGTTGAAATG TTCCGTAAGT TACTAGACGA 500 

AGGTCGTGCA GGTGATAACG TTGGTGTTCT ACTACGTGGT ACCAAGCGTG 550 

ATGAGGTTGA GCGTGGTCAG GTTCTAGCTG CTCCAGGCAC AATCACTCCA 600 

CACACCAAGT TCACTGGTCA GGTTTACGTA CTAAGCAAGG ATGAAGGTGG 650 

TCGTCACACT CCATTCTTCA AGGGCTACCG TCCACAGTTC TTCTTCCGTA 700 

CAACCGATAT TACCGGT.TCT ATCGATCTGA AAGAGGGCGT AGAGATGGTA 750 
ATGCCAGGTG ATAACACCGA CATGACCGTA ACCCTAATCC ACCCAGTAGC ' 800 

TATGGCTGAA GGCGAGAGAT > 820 



2) INFORMATION FOR SEQ ID NO: 1575 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tetragenococcus halophilus 

(B) STRAIN: ATCC 33315 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1575 

GGTAGTGTCT GCAGCTGACG GCCCAATGCC ACAAACTCGT GAACATATCC 50 

TATTGTCACG TAACGTTGGT GTACCATACA TCGTTGTATT CTTAAACAAA 100 

ATGGATATGG TTGACGATGA AGAATTACTT GAATTAGTTG AAATGGAAGT 150 

ACGTGATCTA TTAACTGAAT ACGACTTCCC AGGTGATGAT ACTCCTGTTA 200 

TTTCTGGTTC AGCTTTGAAA GCTTTAGAAG GCGACGAAGA ATATGAACAA 250 

815 
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AAAGTCTTAG 
ACGTGACCAT 
TCACTGGTCG 
AAAGTCGGTG 
GACAACTGTT 
AAGCTGGCGA 
ATCGAACGTG 
AAAATTCTCA 
ATACTCCATT 
GACATCACTG 
AGGTGATAAC 
AAA 



ACTTAATGGC 
GACAAACCGT 
TGGAACTGTT 
ATGAAGTTGA 
ACGGGTGTAG 
TAACATCGGT 
GTCAAGTGTT 
GCTGAAGTTT 
CTTCTCAAAC 
GTGTCATTGA 
GTAGCAATGG 



AGCTGTAGAT GATTTCATCC CAACTCCTGA 
TCATGATGCC GATTGAAGAT GTTTTCTCAA 
GCTACAGGTC GTGTTGAACG TGGAACTATT 
TATCATCGGT ATTCATGAAA ATGTTAAAAA 
AAATGTTCCG TAAATTGTTG GATTACGCTG 
ACATTATTGC GTGGTGTTTC TCGTGATGAT 
GGCTAAACCA GGCACAATCA CACCACATAC 
ATGTATTAAC AAAAGAAGAA GGCGGACGTC 
TATCGCCCAC AATTTTACTT CCGTACAACT 
ATTGCCAGAA GGTACTGAAA TGGTTATGCC 
AAGTTGAATT AATTCACCCT GTTGCTATTG 



300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
803 
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2) INFORMATION FOR SEQ ID NO: 1576 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni subp. jejuni 

(B) STRAIN: ATCC 33292 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1576 

TTAATGAAGC AATTGTTGTA AATTTTGAAA GCGAAGGAAA AAAACATAAA 50 

CTTGTTTTAG AAGTAGCAGC TCATTTAGGA GATAATAGAG TTAGAACTAT 100 

TGCTATGGAT ATGACAGATG GTTTGGTAAG GGGCTTAAAA GCTGAGGCTT 150 

TAGGTGCTCC TATTAGTGTT CCTGTTGGTG AGAAAGTTTT AGGAAGAATT 200 

TTCAATGTTA CTGGAGATTT GATCGATGAA GGTGAAGAAA TTTCTTTTGA 250 

TAAAAAATGG GCAATTCATA GAGATCCGCC AGCTTTTGAA GATCAAAGCA 300 

CAAAAAGTGA GATTTTTGAA ACAGGGATTA AAGTTGTAGA TTTGCTTGCT 350 

CCTTATGCAA AAGGTGGTAA AGTAGGTCTT TTTGGTGGTG CAGGTGTTGG 400 

TAAAACTGTT ATTATTATGG AGCTTATTCA CAATGTTGCA TTTAAGCATA 450 

GCGGCTATTC TGTATTTGCA GGTGTGGGTG AGAGAACTCG TGAAGGAAAT 500 

GACCTTTATA ATGAAATGAA AGAAAGTAAT GTTTTAGACA AAGTTGCTCT 550 

ATGTTATGGA CAAATGAATG AACCACCAGG AGCAAGAAAT CGTATTGCTT 600 

TAACAGGTTT AACAATGGCT GAGTATTTTA GAGATGAAAT GGGTCTTGAT 650 

GTGCTTATGT TTATTGATAA TATCTTTAGA TTTTCACAAT CAGGTTCTGA 700 

AATGTCAGCA CTTTTAGGAA GAATTCCATC AGCTGTGGGT TATCAACCAA 750 

CCCTAGCAAG TGAAATGGGT AAATTCCAAG AAAGAATTAC TTCAACTAAA 800 

AAAGG 805 



2) INFORMATION FOR SEQ ID NO: 1577 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: * 809 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter rectus 

(B) STRAIN: ATCC 33238 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1577 

ATTTACCCAA GATCAACGAA GCCGTCGAAG TAAATTTCGA GGTTGAGGGC 50 

AAGCAAAACA GACTTGTGTT AGAGGTTGCC GCACACCTTG GCGATAACCG 100 

CGTAAGAACG ATCGCTATGG ATATGAGCGA AGGCTTAACC AGAGGCCTTG 150 

AGGCTACGGC TCTTGGCGCG CCTATTAGCG TTCCGGTCGG CGAAAAAGTT 200 

TTGGGAAGGA TTTTTAACGT CGTCGGCGAT CTGATCGACG AGGGTGAAGG 250 

CATAGAATTT GATAAAAAAT GGTCTATCCA CCGCGATCCT CCGCCGTTTG 300 

AAGAGCAAAG CACGAAGAGT GAAATTTTTG AAACGGGTAT AAAAGTGGTC 350 

GATCTTCTAG CCCCTTATGC AAAAGGCGGT AAGGTCGGAT TATTCGGCGG 400 

TGCCGGCGTC GGTAAGACGG TTATTATCAT GGAGCTTATC CACAACGTTG 450 

CATTTAAGCA TAGCGGTTAT TCCGTGTTTG CCGGCGTGGG CGAGCGAACC 500 

CGCGAAGGAA ACGACCTTTA TCACGAGATG AAAGAGAGTA ACGTTTTGGA 550 

CAAAGTCGCC TTGTGCTACG GCCAGATGAA CGAGCCGCCG GGAGCAAGAA 600 

ACCGCATCGC TCTAACAGGC CTAACGATGG CTGAATACTT CCGCGACGAG 650, 

ATGGGACTTG ACGTTTTGAT GTTTATAGAC AACATCTTCC GTTTCTCTCA 700 

GTCTGGCGCT GAGATGTCGG CGCTTCTTGG ACGTATCCCG TCAGCCGTTG 7 50 

GTTATCAGCC GACTTTGGCG AGCGAAATGG GCAAATTCCA AGAGAGAATT 800 

ACATCAACC 809 



2) INFORMATION FOR SEQ ID NO: 1578 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1671 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus casseliflavus 

(B) STRAIN: ATCC 25788 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1578 

AATCGAAGTG CAACGTTCAT TGCGTGTACT TGATGGTGCT GTAACTGTGT 50 
TGGACTCACA ATCAGGTGTA GAACCTCAAA CAGAAACAGT TTGGCGTCAA 100 
GCGACAGATT ACCGCGTACC GCGTGTTGTA TTCTGTAACA AAATGGACAA 150 

817 
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AATTGGTGCA GACTTCTTAT ACTCTGTATC 
AAGCAAATGC TCACCCAATC CAATTACCAA 
ACTGGTATTA TCGACTTAGT AAAAATGAAA 
CTTAGGAACT GAAATCCAAG AGACTGAAAT 
TAGCTGAAGA ATGGCGCGAA AAATTAATTG 
GAAGAACTAA TGATGAAATT CTTGGAAGGT 
ATTGAAAGCT GGTATTCGTC AAGCAACATT 
TTCTTTGCGG ATCTGCCTTT AAAAACAAAG 
GCAGTCATCG ACTACTTGCC TTCACCACTT 
GATCAATCCT AAAACAGACG AAGAAACTGA 
CACCATTTGC TTCATTAGCA TTTAAAGTAA 
CGTTTGACAT TCTTYCGTGT GTATTCARGT 
CGTATTGAAT GCTTCAAAAG GCAAACGCGA 
AAATGCACGC CAACACTCGT GCTGAAATCC 
ATCGCCGCTG CTGTTGGTTT GAAAGACACA 
TGATGAAAAA TCCCCAGTAA TCCTTGAATC 
TTATCGAAGT CGCTGTTGAG CTTAAATCAA 
GGGGTTGCTT TACAAAAACT TGCTGAAGAA 
AMCAAACGCT GAAACAGGCG AAACTGTTAT 
ACTTGGACGT CTTAGTTGAC CGTATGCGTC 
AACGTAGGTG CGCCTCAAGT TTCTTATCGT 
ACAAGCGGAA GGTAAATTTG TACGTCAGTC 
GTCACGTATG GGTCGAATTT ACACCAAACG 
TTCGAAAACG CGATTGTCGG TGGTGTGGTT 
AGTTGAAAAA GGACTTGAAG AATCAATGGC 
ACCCATTAGT AGACATCAAA GCAAAACTTT 
GTCGATTCAA GTGAAACTGC CTTCCGTGTT 
TGCTGCAGCG AAGAAAGCAA ACCCAGTAAT 
TAGTTATCAC TGTACCAGAA GATTACTTAG 
ACTGCTCGTC GTGGACGCGT AGAAGGAATG 
AATCGTGAAC GCAATCGTGC C 
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AACTTTACAT GATCGTTTAC 200 

TTGGTGCGGA AGATGACTTT 250 

GCTGAAATCT ACACAAATGA 300 

TCCTGAAGAA TACGTAGAAT 350 

AAGCTGTTGC TGATACTGAT 400 

GAAGAAATCA CTGAAGAAGA 450 

GACTGTTGAC TTTTTCCCTG 500 

GGGTTCAATT GATGTTGGAT 550 

GATGTTCCTG CGATTAAAGG 600 

TCGTCCGGCT GACGATGAAG 650 

TGACTGACCC ATTCGTARGT 700 

ATCTTGAACT CTGGATCATA 750 

ACGTATCGGT CGGATCCTAC 800 

AAACAGTATA CTCAGGCGAT 850 

ACAACTGGGG ATCCACTATG 900 

AATCGAATTC CCAGAACCAG 950 

AAGCTGACCA AGATAAAATG 1000 

GATCCATCAT TCCGTGTGGA 1050 

CGCTGGTATG GGAGAACTTC 1100 

GCGAATTTAA AGTTGAAGCA 1150 

GAAACATTCC GTGCAGCAAC 1200 

TGGTGGTAAA GGTCAATACG 1250 

AAGAAGGTAA AGGCTTCGAA 1300 

CCTCGTGAAT ACATCCCAGC 1350 

GAACGGTGTC TTAGCCGGTT 1400 

ATGATGGTTC ATACCATGAT 1450 

GCAGCTTCTA TGGCTTTACG 1500,, 

TCTTGAACCA ATGATGAAAG 1550' 

GTGATGTTAT GGGTCACGTA 1600 

GAAGCACWCG GTAACTCACA 1650 

1671 



2) INFORMATION FOR SEQ ID NO: 1579 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1662 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus gallinarum 

(B) STRAIN: ATCC 49573 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1579 

GAAGTACAAC GTTCACTACG GGTTCTTGAC GGWGCTGTAA CAGTATTGGA 50 
CTCACAATCT GGTGTAGAAC CACAAACTGA AACAGTTTGG CGTCAAGCGA 100 
CAGATTACCG CGTACCACGT ATCGTATTCT GTAACAAAAT GGATAAAATC 150 
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GGTGCAGACT TCTTATACTC TGTATCTACT TTACATGATC GCTTGCAAGC 200 

AAATGCTCAT CCAATCCAAT TACCAATTGG TGCGGAAGAT GACTTTACTG 250 

GTATCATCGA TCTAGTAAAA ATGAAAGCTG AGATCTATAC AAACGATTTA 300 

GGAACAGAGA TTCAAGAAAC TGAAATTCCT GAAGAGTACA AAGAATTAGC 350 

5 TGAAGAATGG CGCGAAAAAT TAGTTGAAGC TGTTGCAGAT ACTGACGAAG 400 

AGCTAATGAT GAAATTCTTG GAAGGTGAAG AAATCACTGA AGAAGAATTG 4 50 

AAAGCTGGTA TCCGTCAAGC GACATTGACT GTTGAATTTT TCCCAGTTCT 500 

TTGTGGTTCA GCCTTCAAAA ACAAAGGGGT TCAATTGATG TTGGATGCAG 550 

TCATCGACTA CCTTCCTTCA CCACTTGATG TTCCTGCAAT CAAAGGGATC 600 

10 AATCCTAAAA CTGACGAAGA AACTGATCGT CCTGCTGACG ATGAAGCGCC 650 

TTTTGCTTCA CTAGCATTTA AAGTAATGAC TGACCCATTC GTAGGTCGTT 700 

TGACATTCTT CCGTGTGTAT TCAGGTGTCT TGAACTCTGG ATCATATGTC 750 

TTGAATGCTT CAAAAGACAA ACGCGAACGT ATCGGTCGTA TTCTGCAAAT 800 

GCACGCGAAC ACTTGTGCAG AAATCCAAAC AGTTTATTCA GGCGATATCG 850 

15 CTGCAGCTGT TGGTTTGAAA GATTCCACAA CAGGGGATAC ATTGTGTGCG 900 

AAAAATCACC CAGTAATCCT TGAATCAATC GAATTCCCAG AMCCAGTTAT 950 

CGAAGTAGCT GTTGAACYTA AATCAAAAGC TGACCAAGAT AAAATGGGTG 1000 

TGGCTTTACA AAAACTTGCT GAAGAAGATC CTTCATTCCG TGTAGAAMCA 1050 

AACGCTGAAA CTGGCGAAAC TGTTATCGCA GGGATGGGTG AACTTCACTT 1100 

20 GGACGTCCTT GTTGACCGTA TGCGTCGCGA ATTTAAAGTT GAAGCAAACG 1150 

TTGGTGCGCC TCAAGTTTCT TACCGCGAAA CATTCCGTGC TTCTACGCAA 1200 

GCCGAAGGTA AATTTGTACG TCAGTCTGGT GGTAAAGGTC AATACGGTCA 1250 

CGTATGGATC GAATTTACAC CAAACGAAGA AGGTAAAGGC TTCGAATTCG 1300 

AAAACGCAAT TGTCGGTGGT GTGGTTCCAC GTGAATACAT CCCAGCAGTT 1350 

25 GAAAAAGGAC TTGAAGACTC AATGGCTAAC GGTGTTCTAG CTGGTTATCC 1400 

ATTGGTTGAC ATCAAAGCCA AGCTTTACGA TGGTTCATAC CATGATGTCG 1450 

ATTCAAGTGA AACAGCCTTC CGTGTGGCAG CTTCAATGGC TTTACGTGCT 1500 

GCAGCGAAGA AAGCTAATCC AGTGATTCTT GAACCAATGA TGAAAGTTGT 1550 

TATCACTGTT CCTGAAGATT ACTTAGGTGA TATTATGGGA GACGTAACTG 1600 

30 CTCGTCGTGG ACGTGTTGAA GGTATGGAAG CGCATGGTAA CTCACAAATC 1650 

GTTAACGCGA TT 1662 



35 2 ) INFORMATION FOR SEQ ID NO: 1580 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1669 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 49456 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1580 

CAATCGAAGT ACAACGTTCT CTTCGTGTAT TGGATGGTGC TGTTACCGTT 50 
CTTGACTCAC AATCAGGTGT TGAGCCTCAA ACTGAAACAG TTTGGCGTCA 100 
AGCAACTGAG TATGGAGTTC CACGTATCGT ATTTGCCAAC AAAATGGACA 150 
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AAATCGGTGC TGACTTCCTT TACTCTGTAA GCACACTTCA CGATCGTCTT 200 

CAAGCAAATG CACACCCAAT CCAATTGCCA ATCGGTTCTG AAGATGACTT 250 

CCGTGGTATC ATCGACTTGA TCAAGATGAA AGCTGAAATC TATACTAACG 300 

ACCTTGGTAC AGATATCCTT GAAGAAGACA TCCCAGCTGA ATACCTTGAC 350 

5 CAAGCTCAAG AATACCGTGA AAAATTGATC GAAGCAGTTG CTGAAACTGA 400 

CGAAGAATTG ATGATGAAAT ACCTCGAAGG TGAAGAAATC ACTAACGAAG 450 

AATTGAAAGC TGGTATCCGT AAAGCGACTA TCAACGTTGA ATTCTTCCCA 500 

GTATTGTGTG GCTCTGCCTT CAAGAACAAA GGTGTTCAAT TGATGCTTGA 550 

TGCGGTTATT GACTACCTTC CAAGCCCACT TGACATCCCA GCGATCAAAG 600 

10 GTATCAACCC AGATWCAGAT GAAGAAGAAA CTCGTCCAGC ATCTGATGAA 650 

GAGCCATTTG CAGCTCTTGC CTTCAAGATC ATGACAGACC CATTCGTAAG 700 

TCGTTTGACA TTCTTCCGTG TTTACTCARG TGKTCTTCAA TCAGGTTCAT 750 

ACGTATTGAA CACTTCTAAA GGKAAACGTG AGCGTATCGG ACGTATCCTT 800 

CAAATGCACG CCAACAGCCG TCAAGAAATT GACACTGGTT ACTCAAGAGA 850 

15 TATCGCTGCT GCCGTTGGTT TGAAAGATAC TCCAACTGGT GACTCATTGC 900 

CAGATGAAAA AGCTAAAATC ATTCTTGAGT CAATCAACGT TCCAGAMCCA 950 

GTTATCCAAT TGATGGTTGA GCCAAAATCT AAAGCTGACC AAGCCAAGAT 1000 

GGGTATCGCC CTTCAAAAAT TGGCTGAAGA AGATCCAACA TTCCGCGTTG 1050 

AAMCAAACGT TGAAACTGGT GAAMCAGTTA TCTCTGGTAT GGGTGAGCTT 1100 

20 CACCTTGACG TCCTTGTTGA CCGTATGCGT CGTGAGTTCA AAGTTGAAGC 1150 

AAACGTAGGT GCTCCTCAAG TATCTTACCG TGAAACATTC CGCGCTTCTA 1200 

CTCAAGCACG TGGATTCTTC AAACGTCAGT CTGGTGGTAA AGGTCAATTC 1250 

GGTGATGTAT GGATTGAATT TACTCCAAAC GAAGAAGGTA AAGGATTCGA 1300 

ATTCGAAAAC GCAATCGTCG GTGGTGTGGT TCCTCGTGAA TTTATCCCAG 1350 

25 CGGTTGAAAA AGGTTTGGTA GAATCTATGG CTAACGGTGT ACTTGCAGGT 1400 

TACCCAATGG TTGACGTTAA AGCTAAGCTT TACGATGGTT CATACCACGA 1450 

TGTCGACTCA TCTGAAACTG CCTTCAAGAT TGCGGCTTCA CTTGCTCTTA 1500 

AAGAAGCTGC TAAATCAGCA CAACCAGCTA TCCTTGAGCC AATGATGCTT 1550 

GTAACAATCA CTGTTCCAGA AGAAAACCTT GGTGATGTTA T.GGGTCACGT 1600 

30 AACTGCTCGT CGTGGACGTG TAGATGGTAT GGAAGCACWC GGTAACAGCC 1650 

AAATCGTTCG TGCTTACGT 1669 



35 2) INFORMATION FOR SEQ ID NO: 1581 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS : Single 

(D) ' TOPOLOGY: Linear 

^(ii) MOLECULE TYPE: DNA 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1581 

AATTGGGGAC TACACCTATT ATGATG 26 



50 



2) INFORMATION FOR SEQ ID NO: 1582 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1582 
GGCAAATCAG TCAGTTCAGG AGT 



2) INFORMATION FOR SEQ ID NO: 1583 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1583 
CGATTGGCAA CAATACACTC CTG 



2) INFORMATION FOR SEQ ID NO: 1584 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1584 
TCACCTATTT TTACGCCTGG TAGGAC 



2) INFORMATION FOR SEQ ID NO: 1585 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 645 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 
5 (C) ACCESSION NUMBER: AF139725 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1585 

ATGACTATAC CTGACGCAAA TGCAATCTAT CCTAACTCAG CCATCAAAGA 50 

10 GGTTGTCTTT ATCAAGAACG TGATCAAAAG TCCCAATATT GAAATTGGGG 100 

ACTACACCTA TTATGATGAC CCAGTAAATC CCACCGATTT TGAGAAACAC 150 

GTTACCCATC ACTATGAATT TCTAGGCGAC AAATTAATCA TCGGTAAATT 200 

TTGTTCTATC GCCAGTGGCA TTGAATTTAT CATGAACGGT GCCAACCACG 250 

TAATGAAAGG TATTTCGACT TATCCATTTA ATATTTTAGG TGGCGATTGG 300 

15 CAACAATACA CTCCTGAACT GACTGATTTG CCGTTGAAAG GTGATACTGT 350 

AGTCGGAAAT GACGTGTGGT TTGGGCAAAA TGTGACCGTC CTACCAGGCG 400 

TAAAAATAGG TGACGGTGCC ATTATCGGAG CAAATAGTGT TGTAACAAAA 450 

GACGTCGCTC CATATACAAT TGTCGGTGGC AATCCAATTC AACTCATCGG 500 

ACCAAGATTT GAACCGGAAG TTATTCAAGC ATTAGAAAAT CTGGCATGGT 550 

20 GGAATAAAGA TATTGAATGG ATAACTGCTA ATGTTCCTAA ACTAATGCAA 600 

ACAACACCCA CACTTGAATT GATAAACAGT TTAATGGAAA AATAA 645 



25 2) INFORMATION FOR SEQ ID NO: 1586 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1586 

ATTCCCACAA TCTTTTTTAT C AATAA 26 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1587 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1587 

CATTGTTCAG ATTCGGTAAA GTTC 24 
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2) INFORMATION FOR SEQ ID NO: 1588 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1588 
GTTTTTGAAG TTAAATAGTG TTCTT 25 



2) INFORMATION FOR SEQ ID NO: 1589 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO,: 1589 
CTTCCATTTG TACTTTCCGT A 21 



2) INFORMATION FOR SEQ ID NO: 1590 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1920 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cloning vector pFW16 
(C) ACCESSION NUMBER: U50983 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1590 

ATGAAAATTA TTAATATTGG AGTTTTAGCT CATGTTGATG CAGGAAAAAC 50 

TACCTTAACA GAAAGCTTAT TATATAACAG TGGAGCGATT ACAGAATTAG 100 

GAAGCGTGGA CAAAGGTACA ACGAGGACGG ATAATACGCT TTTAGAACGT 150 

CAGAGAGGAA TTACAATTCA GACAGGAATA ACCTCTTTTC AGTGGGAAAA 200 

TACGAAGGTG AACATCATAG ACACGCCAGG ACATATGGAT TTCTT AGCAG 250 
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AAGTATATCG TTCATTATCA GTTTTAGATG GGGCAATTCT ACTGATTTCT 300 

GCAAAAGATG GCGTACAAGC ACAAACTCGT ATATTATTTC ATGCACTTAG 350 

GAAAATGGGG ATTCCCACAA TCTTTTTTAT CAATAAGATT GACCAAAATG 400 

GAATTGATTT ATCAACGGTT TATCAGGATA TTAAAGAGAA ACTTTCTGCC 4 50 

5 GAAATTGTAA TCAAACAGAA GGTAGAACTG TATCCTAATG TGTGTGTGAC 500 

GAACTTTACC GAATCTGAAC AATGGGATAC GGTAATAGAG GGAAACGATG 550 

ACCTTTTAGA GAAATATATG TCCGGTAAAT CATTAGAAGC ATTGGAACTC 600 

GAACAAGAGG AAAGCATAAG ATTTCAGAAT TGTTCTCTGT TCCCTCTTTA 650 

TCATGGAAGT GCAAAAAGTA ATATAGGGAT TGATAACCTT ATAGAAGTTA 700 

10 TTACTAATAA ATTTTATTCA TCAACACATC GAGGTCCGTC TGAACTTTGC 750 

GGAAATGTTT TCAAAATTGA ATATACAAAA AAAAGACAAC GTCTTGCATA 800 

TATACGCCTT TATAGTGGAG TACTACATTT ACGAGATTCG GTTAGAGTAT 850 

CAGAAAAGGA AAAAATAAAA GTTACAGAAA TGTATACTTC AATAAATGGT 900 

GAATTATGTA AGATTGATAG AGCTTATTCT GGAGAAATTG TTATTTTGCA 950 

15 AAATGAGTTT TTGAAGTTAA ATAGTGTTCT TGGAGATACA AAACTATTGC 1000 

CACAGAGAAA AAAGATTGAA AATCCGCACC CTCTACTACA AACAACTGTT 1050 

GAACCGAGTA AACCTGAACA GAGAGAAATG TTGCTTGATG CCCTTTTGGA 1100 

AATCTCAGAT AGTGATCCGC TTCTACGATA TTACGTGGAT TCTACGACAC 1150 

ATGAAATTAT ACTTTCTTTC TTAGGGAAAG TACAAATGGA AGTGATTAGT 1200 

20 GCACTGTTGC AAGAAAAGTA TCATGTGGAG ATAGAAATAA CAGAGCCTAC 1250 

AGTCATTTAT ATGGAGAGAC CGTTAAAAAA TGCAGAATAT ACCATTCACA 1300 

TCGAAGTGCC GCCAAATCCT TTCTGGGCTT CCATTGGTCT ATCTGTATCA 1350 

CCGCTTCCGT TGGGAAGTGG AATGCAGTAT GAGAGCTCGG TTTCTCTTGG 1400 

ATACTTAAAT CAATCATTTC AAAATGCAGT TATGGAAGGG ATACGCTATG 1450 

25 GTTGTGAACA AGGATTGTAT GGTTGGAATG TGACGGACTG TAAAATCTGT 1500 

TTTAAGTATG GCTTATACTA TAGCCCTGTT AGTACCCCAG CAGATTTTCG 1550 

GATGCTTGCT CCTATTGTAT TGGAACAAGT CTTAAAAAAA GCTGGAACAG 1600 

AATTGTTAGA GCCATATCTT AGTTTTAAAA TTTATGCGCC ACAGGAATAT 1650 

CTTTCACGAG CATACAACGA TGCTCCTAAA TATTGTGCGA ACATCGTAGA 1700 

30 CACTCAATTG AAAAATAATG AGGTCATTCT TAGTGGAGAA ATCCCTGCTC 1750 

GGTGTATTCA AGAATATCGT AGTGATTTAA CTTTCTTTAC AAATGGACGT 1800 

AGTGTTTGTT TAACAGAGTT AAAAGGGTAC CATGTTACTA CCGGTGAACC 1850 

TGTTTGCCAG CCCCGTCGTC CAAATAGTCG GATAGATAAA GTACGATATA 1900 

TGTTCAATAA AATAACTTAG 1920 

35 



2) INFORMATION FOR SEQ ID NO: 1591 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1591 
50 ATGAGGTAAT AGAACGGATT 20 



2) INFORMATION FOR SEQ ID NO: 1592 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1592 
CAGTATTTCA GTAAGCGTAA A 



2) INFORMATION FOR SEQ ID NO: 1593 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1593 
CCGAGCGATT TACCGGATAC TTGGCTGCIC GCTCGG 



2) INFORMATION FOR SEQ ID NO: 1594 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: N97-330 

(C) ACCESSION NUMBER: AF175293 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1594 

ATGTTTAGAA TTAAAGTTGC AGTTCTGTTT GGGGGCTGTT CAGAGGAACA 50 

TAATGTTTCG ATAAAATCTG CGATGGAGAT TGCCGCAAAC ATAGATACAA 100 

AAAAGTATCA GCCTTATTAT ATTGGAATCA CAAAATCCGG CGTTTGGAAA 150 

ATGTGTGAAA AACCTTGTTT GGAGTGGGAA CAATATGCGG GGGATCCGGT 20 0 

TGTTTTTTCA CCGGACAGAA -GTACGCATGG TCTGCTGATA CAAAAAGACA 250 
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CTGGGTATGA 
AAGTTTGGCG 
TCCGTATGTG 
CGCTTGCATA 
CGGATCCTTC 
CGTTTTTGTA 
TATGCAAGGC 
GACAGCAAGA 
CGCCATACTG 
TTGAGCTGAG 
AAAGGATCTG 
GGTAATAGAA 
GCTGCAGAGG 
ATTGTGTTGA 
TTATCCCCGC 
ATCGCTTGAT 



AATCCAGCCT 
AGGATGGCTC 
GGATGCGATA 
TACCGTTGTG 
AGGAGGGGGA 
AAGCCTGCCC 
AGAAGAACTG 
TTTTGATTGA 
GGAAACGGAA 
ACACGGCTTT 
AAAATGCAGT 
CGGATTCAGA 
ATTGGCCCGC 
ATGAAGTGAA 
ATGATGACAG 
TGAACTTTCA 



GTGGATGTGG 
CATACAAGGC 
TTCAAAGCTC 
AAAAATGCGG 
TCGCCTGGAA 
GTTCCGGCTC 
CAGGCAGCAA 
AGAGGCCGTT 
ATGATCTCAT 
TTTAAGATTC 
CATCCGAGTT 
AAACAGCAAT 
ATTGACCTGT 
TACCATGCCG 
CAGCCGGTTT 
CTTAGGAGGT 



GATTACCGAT GATTCATGGC 
TTGCTTGAAT TGTCAGGCAT 
CGTGACCTGC ATGGATAAGG 
GTATCGCTGT GCCTGGGTTC 
ACAGAGGATT TCGTATATCC 
ATCCTTTGGC GTAAACAAGG 
TCGAAGATGC AAGAAAATAT 
ACCGGGAGTG AGGTAGGCTG 
GGCTGGCGAG GTGGATCAGA 
ATCAGGAAGC ACAGCCGGAG 
CCAGCCGCCT TACCGGATGA 
GAAGATTTAC CGGATACTTG 
TTTTGCGGGA GGACGGCTGC 
GGTTTTACTT CCTACAGCCG 
TACGCTTACT GAAATACTGG 
AA 



300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1032 



2) INFORMATION FOR SEQ ID NO: 1595 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1595 
AAATAATGCT CCATCAATTT GCTGA 



2) INFORMATION FOR SEQ ID NO: 1596 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1596 
ATAGTCGAAA AAGCCATCCA CAAG 



2) INFORMATION FOR SEQ ID NO: 1597 
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30 



40 



45 



WO 01/23604 PCT/CA00/01 150 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1597 
GATGAATTTG CGAAAATACA TGGA 24 



15 2) INFORMATION FOR SEQ ID NO: 1598 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1598 

CAGCCAATTT CTACCCCTTT CAC 23 



2) INFORMATION FOR SEQ ID NO: 1599 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 604 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: BM4405 

(C) ACCESSION NUMBER: AF136925 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1599 



CAGGGAGTAT TTGAGTTATT AGATATTCCA TATGTAGGTT GTGGTATCGG 50 

AGCTGCAGCA ATTTCCATGA ATAAAATAAT GCTCCATCAA TTTGCTGAAA 100 

50 TAGTTGGTGT AAAAAGCACT CCTAGTATGA TTATAGAAAA GGGACAAGAC 150 

CTACAAAAAG TCGATGAATT TGCGAAAATA CATGGATTTC CTTTATATAT 200 

CAAACCGAAT GAGGCAGGCT CATCAAAAGG AATTAGCAAG GTAGAACAAA 250 

AAAGTGATTT ATATAAAGCA ATAGACGAAG CTTCAAAATA TGATAGCCGT 300 
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ATTTTAATTC AAAAGGAAGT 
AGGGAATGAA CAATTGGTCG 
ATGGCTTTTT CGACTATGAA 
TTGTTACCAG CTAAACTATC 
AGCAAAAAAA CTATACAGAC 
ACTTTTTCTT AACGGATGAC 
CTCC 



GAAAGGGGTA GAAATTGGCT GTGGGATTTT 
TTGGAGAATG TGATCAAATT AGTCTTGTGG 
GAGAAATACA ATTTAGTAAC AGCAGAAATT 
AATAGACAAA AAAGAAGACA TCCAGATAAA 
TATTAGGGTG CAAAGGATTA GCGAGAATCG 
GGAGAAATTT TATTAAATGA GATCAACACC 



350 
400 
450 
500 
550 
600 
604 



2) INFORMATION FOR SEQ ID NO: 1600 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni subsp. doylei 

(B) STRAIN: ATCC 49349 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1600 

ACCTCAAATT AATGAAGCAA TTGTTGTAAA TTTTGAAAGC GAAGGAAAAA 50 

AACATAAACT TGTTTTAGAA GTAGCAGCTC ATTTAGGAGA TAATAGAGTT 100 

AGAACTATTG CTATGGATAT GACAGATGGT TTGGTAAGGG GCTTAAAAGC 150. 

TGAGGCTTTA GGTGCTCCTA TTAGTGTTCC TGTTGGTGAG AAAGTTTTAG 200 

GAAGAATTTT CAATGTTACT GGAGATTTGA TCGATGAAGG TGAAGAAATT 250 

CCTTTTGATA AAAAATGGGC AATTCATAGA GATCCGCCAG CTTTTGAAGA 300 

TCAAAGCACA AAAAGTGAGA TTTTTGAAAC AGGGATTAAA GTTGTAGATT 350 

TACTTGCTCC TTATGCAAAA GGTGGTAAAG TAGGTCTTTT TGGTGGTGCA 400 

GGTGTTGGTA AAACTGTTAT TATTATGGAG CTTATTCACA ATGTTGCATT 450 

TAAGCATAGC GGCTATTCTG TATTTGCAGG TGTGGGTGAG AGAACTCGTG 500 

AAGGAAATGA CCTTTATAAT GAAATGAAAG AAAGTAATGT TTTAGACAAA 550 

GTTGCTCTAT GTTATGGACA AATGAATGAA CCACCAGGAG CAAGAAATCG 600 

TATTGCTTTA ACAGGTTTAA CAATGGCTGA GTATTTTAGA GATGAAATGG 650 

GTCTTGATGT GCTTATGTTT ATTGATAATA TCTTTAGATT TTCACAATCA 700 

GGTTCTGAAA TGTCAGCACT TTTAGGAAGA ATTCCATCAG CTGTGGGTTA 750 

TCAACCAACC CTAGCAAGTG AAATGGGTAA ATTCCAAGAA AGAATTACTT 800 

CAACT 805 



2) INFORMATION FOR SEQ ID NO: 1601 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus sulfureus 

(B) STRAIN: ATCC 49903 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1601 

CTCCTTACCA GATATCAACA ATGCGCTGGT TGTATATAAA AATGATGCCA 50 

ATAAAACAAA AGTTGTTTTA GAAGCAGCGT TAGAATTAGG TGACGGAATC 100 

ATTCGTGCGA TTGCAATGGA ATCGACTGAT GGATTGCAAC GTGGGATGGA 150 

AGTTGTCGAT ATGGGAGAAT CTATTTCTGT ACCAGTTGGA ACAGAAACTT 200 

TAGGACGTGT GTTTAATGTA TTAGGAGATA CGATTGACTT AGAAGCTCCT 250 

TTTCCTAAAG ATGCACCGCG TAGTGGAATC CATAAAAAAG CCCCTAATTT 3 00 

TGATGAATTA AGTACAAGCA CAGAAATTCT AGAAACAGGG ATCAAAGTCA 350 

TCGATCTATT AGCCCCTTAT TTAAAAGGTG GTAAAGTTGG ATTGTTTGGG 400 

GGAGCCGGTG TAGGAAAAAC GGTATTGATT CAAGAGTTGA TTCACAATAT 450 

CGCACAAGAA CACGGTGGGA TTTCGGTCTT TACTGGTGTT GGTGAGCGTA 500 

CACGTGAAGG AAACGATTTG TATTATGAAA TGAAAGATTC AGGAGTCATT 550 

GAAAAAACAG CGATGGTGTT TGGACAAATG AATGAGCCAC CTGGTGCACG 600 

GATGCGTGTA GCCTTGACTG GATTAACGAT TGCTGAATAT TTCCGTGATG 650 

TTGAAGGACA GGATGTACTG TTGTTTATCG ACAATATCTT CCGTTTTACA 700 

CAAGCTGGTT CTGAAGTGTC TGCCTTGTTA GGTCGTATGC CATCAGCTGT 750 

GGGATATCAA CCAACATTAG CTACCGAAAT GGGGCAATTG CAAGAGCGGA 800 

TCACGTCAAC GAAAAAAGGA TCGATC 826 



2) INFORMATION FOR SEQ ID NO: 1602 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus solitarius 

(B) STRAIN: ATCC 49428 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1602 

TGATACTTTA CCAGATATTA ATAATGCATT AGTAGTATAT AAAAAGGACG 50 

AGGACAAGAC ACGCGTTGTC TTAGAAGCCA CCTTGGAACT TGGAGATGGC 100 

ATGATTCGTG CAATCTCTAT GGGATCGACT GATGGCTTGC AACGGGGAAT 150 

GGAAGTTGTG GACACACAAG CACCTATTTC TGTT CCAGT A GGAAATGAAA 2 00 

CCTTAGGACG TGTTTTTAAT GTCTTAGGAG AAACGATTGA TAAACAGGCA 250 

CCGTTTCCTG AAGATGCCAA AAAAAGTGGT ATTCATAAAA AAGCACCCGC 3 00 

TTTTGATGAA TTAAGTACCA GTTCTGAAAT ATTGGAAACC GGGATTAAAG 350 

TAATCGATTT GCTAGCTCCT TATTTACGAG GTGGTAAAGT TGGATTATTT 400 
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GGCGGTGCTG GCGTGGGTAA 
CGTTGCCCAA GAACATGGGG 
GTACTCGTGA AGGAAATGAC 
ATTGAAAAAA CGGCTATGGT 
5 ACGTATGCGT GTAGCGTTAA 
ATGTACAAGG TCAAGACGTA 
ACTCAAGCAG GAACAGAAGT 
CGTTGGTTAC CAACCAACTC 
GAATCACATC GACAGATAAA 



AACGGTATTA ATTCAAGAAT 
GAATTTCTGT TTTTACGGGT 
CTATATTATG AAATGCAGGA 
ATTTGGACAA ATGAACGAAC 
CTGGTTTGAC ACTTGCTGAG 
TTATTATTTA TAGATAATAT 
ATCTGCTTTA TTAGGACGGA 
TAGCAACGGA AATGGGACAG 
GGATCAATTA CCT 
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TAATTCATAA 


450 


GTCGGAGAGC 


500 


TTCAGGCGTT 


550 


p ppp p tp p 




TACTTCCGTG 


650 


TTTCCGCTTT 


700 


TGCCGTCTGC 


750 


TTGCAAGAAC 


800 




833 



2) INFORMATION FOR SEQ ID NO: 1603 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter sputorum subsp. sputorum 
25 (B) STRAIN: ATCC 35980 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1603 

ATCAGCCTAA AATTAACGAA GCAATAGAAG TTAATTATGA ATTAGATGGT 50 

30 AAAAAAATAA GACTTATTCT TGAAGTAGCT GGACATCTTG GCGATAATAG 100 

AGCAAGAACC ATTGCTATGG ATATGAGTGA TGGTTTACAA AGAGGATTAG 150 

AAGTTACGGC TCTTGGTGCT CCTATAACAG TTCCTGTTGG AGATAAAGTT 200 

TTAGGTAGAA TGTTTAATGT TGTAGGTGAC TTAATAGATG AAGGTGAAGT 250 

AACAGATTTT GATAAAAGAT GGGCTATCCA TAGAGATCCT CCTTCGTTTG 300 

35 AAGATCAAAG TACAAAAAGT GAAATTTTTG AAACAGGTAT AAAAGTAGTT 350 

GATCTTCTTG CACCTTATGC AAAAGGTGGT AAAGTTGGCT TATTTGGTGG 400 

TGCTGGCGTT GGAAAAACAG TTATCATAAT GGAGCTTATA CATAATGTTG 450 

CATTTAAACA CAGCGGTTAT TCAATTTTTG CCGGTGTTGG AGAGAGAACA 500 

AGAGAGGGAA ATGATCTTTA TAATGAGATG AAAGAGTCTG GTGTTTTGGA 550 

40 TAAAGTTGCC TTATGTTATG GACAAATGAA TGAACCACCA GGAGCAAGAA 600 

ACCGTATAGC ATTAACAGGT CTTACAATGG CTGAATATTT CCGTGATGAA 650 

ATGGGGCTTG ATGTGTTGAT GTTTATAGAT AATATTTTTA GATTTTCTCA 700 

ATCAGGTTCT GAAATGTCAG CGCTGCTTGG TAGAATTCCA TCTGCTGTTG 750 

GTTATCAACC AACATTAGCA AGTGAGATGG GAAAACTTCA AGAAAGAATT 800 

45 ACTTCC 806 



2) INFORMATION FOR SEQ ID NO: 1604 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus pseudoavium 

(B) STRAIN: ATCC 49372 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1604 

GGTGTTATCC GCACAATCGC TATGGAATCT ACAGATGGAT TGCAACGGGG 50 

GATGGAAGTT GTCGATACCG GCAAACCAAT CTCTGTTCCT GTAGGTAAAG 100 

AAACATTAGG TCGTGTGTTT AACGTATTAG GTGAAACGAT CGACAAAGAA 150 

GCACCTTTTC CAGAAGATGT AGAAAAGAGC GGTATTCACA AAAAGGCCCC 200 

CGCTTTTGAA GACCTTAGCA CCAGTAATGA GATTTTAGAA ACTGGGATCA 250 

AGGTTATCGA CTTATTAGCC CCTTACTTAA AAGGTGGTAA AGTTGGACTA 300 

TTCGGTGGTG CCGGTGTTGG TAAAACCGTC TTAATTCAAG AACTGATTCA 350 

TAATATCGCC CAAGAACACG GTGGGATTTC TGTCTTTACC GGGGTTGGGG 400 

AACGGACTCG TGAAGGGAAC GACCTTTATT ATGAAATGAA AGAATCCGGC 450 

GTTATTGAAA AAACAGCGAT GGTCTTCGGA CAAATGAATG AGCCACCAGG 500 

TGCGCGGATG CGCGTTGCCT TGACTGGTTT GACATTAGCT GAATATTTCC 550 

GTGATGAAGA AGGTCAAGAT GTGTTGCTAT TTATCGATAA CATTTTCCGC 600 

TTCACACAAG CCGGATCAGA AGTTTCGGCG CTATTAGGTC GGATGCCATC 650 

TGCCGTTGGT TATCAACCAA CCTTGGCAAC AGAAATGGGT CAATTACAAG 700 

AACGAATCAC TTCAACGAAA AAAGGCTCAA TTACATCG 738 



2) INFORMATION FOR SEQ ID NO: 1605 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1605 
ATYATYGAAR TITAYGCICC 



2) INFORMATION FOR SEQ ID NO: 1606 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1606 
CCRAACATIA YICCIACTTT TTC 23 



2) INFORMATION FOR SEQ ID NO: 1607 



(i) SEQUENCE CHARACTERISTICS: 
•(A) LENGTH: 336 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Klebsiella ornithinolytica 

(B) STRAIN: ATCC 31898 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1607 

25 CTGGATTATG CGATGTCGGT CATTGTTGGC CGTGCGCTGC CGGATGTCCG 50 

AGATGGCCTG AAACCGGTAC ACCGTCGCGT ACTTTACGCC ATGAACGTAT 100 

TGGGCAATGA CTGGAACAAA GCCTATAAAA AATCCGCCCG TGTCGTTGGT 150 

GACGTAATCG GTAAATACCA CCCTCATGGT GATACCGCCG TTTATGACAC 200 

CATTGTACGT ATGGCACAGC CATTCTCCTT . GCGTTATATG CTGGTCGATG 250 

30 GCCAGGGTAA CTTCGGTTCT GTCGATGGCG ACTCCGCCGC AGCGATGCGT 300 

TATACGGAAA TCCGTATGTC GAAAATCGCC CACGAG 336 



35 2) INFORMATION FOR SEQ ID NO: 1608 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 341 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 13182 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1608 

CTATCTGGAT TATGCGATGT CGGTCATTGT TGGCCGTGCG CTGCCGGATG 50 
TCCGAGATGG CCTGAAGCCG GTACACCGTC GCGTACTATA CGCCATGAAC 100 
GTATTGGGCA ATGACTGGAA CAAAGCCTAT AAAAAATCTG CCCGTGTCGT 150 
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GGGTGACGTC ATCGGTAAAT ACCACCCTCA TGGTGATACT GCCGTATACG 

ACACCATTGT ACGTATGGCG CAGCCATTCT CCCTGCGTTA CATGCTGGTA 

GATGGCCAGG GTAACTTTGG TTCGGTCGAC GGCGACTCCG CCGCAGCGAT 

GCGTTATACG GAAATCCGTA TGTCGAAGAT CGCCCATGAA C 



200 
250 
300 
341 



2) INFORMATION FOR SEQ ID NO: 1609 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1609 
GCCCTGATCC AAATAGCATA TA 22 



2) INFORMATION FOR SEQ ID NO: 1610 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1610 
CCTGGCATAA CAGTAACATT CTG 23 



2) INFORMATION FOR SEQ ID NO: 1611 

(i) SEQUENCE CHARACTERISTICS: 
* (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1611 
TGGGAAAAAG CAACTCCATC TC 22 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1612 
ACAACTGAAT TCGCAGCAAC AAT 



2) INFORMATION FOR SEQ ID NO: 1613 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 639 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

MOLECULE TYPE: Genomic DNA 
ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: U19459 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1613 

ATGAAATATG GCCCTGATCC AAATAGCATA TATCCACATG AAGAAATAAA 50 

AAGTGTTTGT TTTATTAAAA AT AC AAT T AC CAATCCAAAT ATTATAGTTG 100 

GAGATTATAC TTACTATTCC GATGTTAACG GAGCTGAAAA ATTTGAAGAA 150 

CATGTGACAC ATCATTATGA ATTTAGGGGT GATAAACTTG TAATTGGCAA 200 

GTTTTGTGCA ATAGCTGAAG GTATAGAATT TATTATGAAT GGAGCAAACC 250 

ATAGAATGAA TTCAATAACA ACTTATCCTT TT AAT AT AAT GGGAAATGGT 300 

TGGGAAAAAG CAACTCCATC TCTTGAAGAT TTACCATTTA AGGGAGATAC 350 

TGTTGTTGGA AATGATGTGT GGATTGGTCA GAATGTTACT GTTATGCCAG 400 

GAATTCAAAT AGGAGATGGA GCAATTGTTG CTGCGAATTC AGTTGTTACA 450 

AAAGATGTAC CACCATATCG TATTATTGGT GG AAATCCGA GTAGAATTAT 500 

AAAGAAAAGG TTTGAAGATG AATTGATAGA TTATTTATTG CAAATAAAAT 550 

GGTGGGATTG GTCAGCACAA AAAATATTTT CTAATCTTGA AACACTTTGT 600 

AGCTCTGATT TAGAGAAAAT AAAATCTATT CGAGATTAG 639 



(ii) 
(vi) 



2) INFORMATION FOR SEQ ID NO: 1614 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1614 
CCAATCCAGA AGAAATATAC CC 22 



2) INFORMATION FOR SEQ ID NO: 1615 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1615 
ATTAGTTTAT CCCCAATCAA TTCA 24 

25 

2) INFORMATION FOR SEQ ID NO: 1616 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1616 
40 ATAATGAATG GGGCTAATCA TCGTAT 26 



2) INFORMATION FOR SEQ ID NO: 1617 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1617 



GCCAACAACT GAATAAGGAT CAAC 24 



2) INFORMATION FOR SEQ ID NO: 1618 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 639 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus cohnii 
(C) ACCESSION NUMBER: AF015628 



20 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1618 



ATGAAATGGC AAAATCAGCA AGGCCCCAAT CCAGAAGAAA TATACCCTAT 50 

AGAAGGTAAT AAACATGTTC AATTTATTAA ACCATCTATA ACAAAGCCCA 100 

25 ATATTTTAGT TGGGGAATAT TCATATTACG ATAGTAAAGA TGGTGAATCT 150 

TTTGAAAGCC AAGTTCTTTA TCACTATGAA TTGATTGGGG ATAAACTAAT 200 

ATTAGGGAAG TTTTGTTCTA TTGGACCCGG AACGACATTT ATAATGAATG 250 

GGGCTAATCA TCGTATGGAT GGTTCAACAT TTCCATTCAA TCTTTTCGGA 300 

AATGGTTGGG AGAAGCATAC CCCTACATTG GAAGACCTTC CTTATAAGGG 350 

3 0 TAACACGGAA ATTGGGAACG ATGTTTGGAT TGGACGAGAT GTGACAATTA 400 

TGCCCGGTGT AAAAATAGGA AACGGGGCTA TTATTGCAGC AAAATCGGTT 450 

GTGACAAAGA ACGTTGATCC TTATTCAGTT GTTGGCGGTA ATCCTTCACG 500 

ATTAATTAAG ATAAGGTTTT CCAAGGAAAA AATCGCAGCA TTACTAAAAG 550 

TAAGGTGGTG GGACCTAGAG ATAGAGACGA TAAATGAAAA TATTGATTGC 600 

35 ATCCTGAATG GTGATATAAA AAAGGTTAAA AGAAGTTAG 639 



2) INFORMATION FOR SEQ ID NO: 1619 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1619 

50 

AAGGCAAAAT AAAAGGAGCA AAGC 24 
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2) INFORMATION FOR SEQ ID NO: 1620 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1620 
TGTACCCGAG ACATCTTCAC CAC 



23 



15 



20 



25 



2) INFORMATION FOR SEQ ID NO: 1621 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1621 
AATTGAAGGA CGGGTATTGT GGAAAG 



26 



30 



35 



40 



2) INFORMATION FOR SEQ ID NO: 1622 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1622 



45 CGATTTTGAC AGATGGCGAT AATGAA 



26 



2) INFORMATION FOR SEQ ID NO: 1623 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1569 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY Linear 



(ii) 



MOLECULE TYPE: Genomic DNA 



5 



(vi) 



ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: M90056 



10 



(xi) 



SEQUENCE DESCRIPTION: SEQ 



ID NO: 1623 



ATGAAAATAA TGTTAGAGGG ACTTAATATA AAACATTATG TTCAAGATCG 
TTTATTGTTG AACATAAATC GCCTAAAGAT TTATCAGAAT GATCGTATTG 
GTTTAATTGG TAAAAATGGA AGTGGAAAAA CAACGTTACT TCACATATTA 

15 TATAAAAAAA TTGTGCCTGA AGAAGGTATT GTAAAACAAT TTTCACATTG 
TGAACTTATT CCTCAATTGA AGCTCATAGA ATCAACTAAA AGTGGTGGTG 
AAGTAACACG AAACTATATT CGGCAAGCGC TTGATAAAAA TCCAGAACTG 
CTATTAGCAG ATGAACCAAC AACTAACTTA GATAATAACT ATATAGAAAA 
ATTAGAACAG GATTTAAAAA ATTGGCATGG AGCATTTATT ATAGTTTCAC 

20 ATGATCGCGC TTTTTTAGAT AACTTGTGTA CTACTATATG GGAAATTGAC 
GAGGGAAGAA TAACTGAATA TAAGGGGAAT TATAGTAACT ATGTTGAACA 
AAAAGAATTA GAAAGACATC GAGAAGAATT AGAATATGAA AAATATGAAA 
AAGAAAAGAA ACGATTGGAA AAAGCTATAA ATATAAAAGA ACAGAAAGCT 
CAACGAGCAA CTAAAAAACC GAAAAACTTA AGTTTATCTG AAGGCAAAAT 

25 AAAAGGAGCA AAGCCATACT TTGCAGGTAA GCAAAAGAAG TTACGAAAAA 
CTGTAAAATC TCTAGAAACC AGACTAGAAA AACTTGAAAG CGTCGAAAAG 
AGAAACGAAC TTCCTCCACT TAAAATGGAT TTAGTGAACT TAGAAAGTGT 
AAAAAATAGA ACTATAATAC GTGGTGAAGA TGTCTCGGGT ACAATTGAAG 
GACGGGTATT GTGGAAAGCA AAAAGTTTTA GTATTCGCGG AGGAGACAAG 

30 ATGGCAATTA TCGGATCTAA TGGTACAGGA AAGACAACGT TTATTAAAAA 
AATTGTGCAT GGGAATCCTG GTATTTCATT ATCGCCATCT GTCAAAATCG 
GTTATTTTAG CCAAAAAATA GATACATTAG AATTAGATAA GAGCATTTTA 
GAAAATGTTC AATCTTCTTC ACAACAAAAT GAAACTCTTA TTCGAACTAT 
TCTAGCTAGA ATGCATTTTT TTAGAGATGA TGTTTATAAA CCAATAAGTG 

35 TCTTAAGTGG TGGAGAGCGA GTTAAAGTAG CACTAACTAA AGTATTCTTA 
AGTGAAGTTA ATACGTTGGT ACTAGATGAA CCAACAAACT TTCTTGATAT 
GGAAGCTATA GAGGCGTTTG AATCTTTGTT AAAGGAATAT AATGGCAGTA 
TAATCTTTGT ATCTCACGAT CGTAAATTTA TCGAAAAAGT AGCCACTCGA 
ATAATGACAA TTGATAATAA AGAAATAAAA ATATTTGATG GCACATATGA 

40 ACAATTTAAA CAAGCTGAAA AGCCAACAAG GAATATTAAA GAAGATAAAA 
AACTTTTACT TGAGACAAAA ATTACAGAAG TACTCAGTCG ATTGAGTATT 
GAACCTTCGG AAGAATTAGA ACAAGAGTTT CAAAACTTAA TAAATGAAAA 
AAGAAATTTG GATAAATAA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1250 
1300 
1350 
1400 
1450 
1500 
1550 
1569 



45 



2) INFORMATION FOR SEQ 



ID NO: 1624 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1624 
TTCTTTAATG CTCGTAGATG AACCTA 



2) INFORMATION FOR SEQ ID NO: 1625 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1625 
TTTTCGTATT CTTCTTGTTG CTTTC 



2) INFORMATION FOR SEQ ID NO: 1626 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26' bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1626 
AGGAATGATT AAGCCCCCTT CAAAAA 



2) INFORMATION FOR SEQ ID NO: 1627 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1627 
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I 

TTACATTGCG ACCATGAAAT TGCTCT 26 



2) INFORMATION FOR SEQ ID NO: 1628 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1658 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Staphylococcus aureus 

(C) ACCESSION NUMBER: U82085 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1628 

20 ATGCTTAAAA TCGACATGAA GAATGTAAAA AAATATTATG CAGATAAATT 50 

AATTTTAAAT ATAAAAGAAC TAAAGATTTA TAGTGGGGAT AAAATAGGTA 100 

TTGTAGGTAA GAATGGAGTT GGCAAAACAA CACTTTTAAA AATAATAAAA 150 

GGACTAATAG AGATTGACGA AGGAAATATA ATTATAAGTG AAAAAACAAC 200 

TATTAAATAT ATCTCTCAAT TAGAAGAACC ACATAGTAAG ATAATTGATG 250 

25 GAAAATATGC TTCAATATTT CAAGTTGAAA ATAAGTGGAA TGACAATATG 300 

AGTGGTGGTG AAAAAACTAG ATTTAAACTA. GCAGAGGGAT TTCAAGATCA 350 

ATGTTCTTTA ATGCTCGTAG ATGAACCTAC AAGTAATTTA GATATCGAAG 400 

GAATAGAGTT GATAACAAAT ACTTTTAAAG AGTACCGTGA TACTTTTTTG 450 

GTAGTATCTC ATGATAGAAT TTTTTTAGAT CAAGTTTGTA CAAAAATTTT 500 

30 TGAAATTGAA AATGGATATA TTAGAGAATT CATCGGTAAT TATACAAACT 550 

ATATAGAGCA AAAAGAAATG CTTCTACGAA AGCAACAAGA AGAATACGAA 600 

AAGTATAATT CTAAAAGAAA GCAATTGGAG CAAGCTATAA AGCTAAAAGA 650 

GAATAAGGCG CAAGGAATGA TTAAGCCCCC TTCAAAAACA ATGGGAACAT 700 

CTGAATCTAG AATATGGAAA ATGCAACATG CTACTAAACA AAAAAAGATG 750 

35 CATAGAAATA CGAAATCGTT GGAAACACGA ATAGATAAAT TAAATCATGT 800 

AGAAAAAATA AAAGAGCTTC CTTCTATTAA AATGGATTTA CCTAATAGAG 850 

AGCAATTTCA TGGTCGCAAT GTAATTAGTT TAAAAAACTT ATCTATAAAA 900 

TTTAATAATC AATTTCTTTG GAGAGATGCT TCATTTGTCA TTAAAGGTGG 950 

AGAAAAGGTT GCTATAATTG GTAACAATGG TGTAGGAAAA ACAACATTGT 1000 

40 TGAAGCTGAT TCTAGAAAAA GTAGAATCAG TAATAATATC ACCATCAGTT 1050 

AAAATTGGAT ACGTCAGTCA AAACTTAGAT GTTCTACAAT CTCATAAATC 1100 

TATCTTAGAA AATGTTATGT CTACCTCCAT TCAAGATGAA ACAATAGCAA 1150 

GAATTGTTCT AGCAAGATTA CATTTTTATC GCAATGATGT TCATAAAGAA 1200 

ATAAATGTTT TGAGTGGTGG AGAACAAATA AAGGTTGCTT TTGCCAAGCT 1250 

45 ATTTGTTAGC GATTGTAATA CATTAATTCT TGATGAACCA ACAAACTATT 1300 

TGGATATCGA TGCTGTTGAG GCATTAGAAG AATTGTTAAT TACCTATGAA 1350 

GGTGTTGTGT TATTTGCTTC CCATGATAAA AAATTTATAC AAAACCTAGC 1400 

TGAACAATTG TTAATAATAG AAAATAATAA AGTGAAAAAA TTCGAAGGAA 1450 

CATATATAGA ATATTTAAAA ATTAAAGATA AACCAAAATT AAATACAAAT 1500 

50 GAAAAAGAAC TCAAAGAAAA AAAGATGATA CTAGAAATGC AAATTTCATC 1550 

ATTATTAAGT AAAATCTCAA TGGAAGAAAA TGAAGAAAAA AACAAAGAAT 1600 

TAGATGAAAA GTACAAATTG AAATTAAAAG AATTGAAAAG CCTAAATAAA 1650 

AATATTTA 1658 
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2) INFORMATION FOR SEQ ID NO: 1629 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1629 
AAGGGGAAAG TTTGGATTAC ACAACA 



2) INFORMATION FOR SEQ ID NO: 1630 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

.(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1630 
GAACCACAGG GCATTATCAG AACC 



2) INFORMATION FOR SEQ ID NO: 1631 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1631 
CGACGATGCT TTATGGTTTG T 



2) INFORMATION FOR SEQ ID NO: 1632 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1632 
GTTAATTTGC CTATCTTGTC ACACTC 26 

2) INFORMATION FOR SEQ ID NO: 1633 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 900 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

20 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: M36022 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1633 

ATGGAATTTA AATTACAAGA ATTAAATCTT ACTAACCAAG' ATACAGGACC 50 

ATATGGTATA ACCGTTTCAG ATAAGGGGAA AGTTTGGATT ACACAACATA 100 

AAGCAAATAT GATAAGTTGC ATCAATTTAG ATGGAAAAAT TACAGAGTAC 150 

30 CCACTACCGA CACCAGATGC AAAAGTCATG TGTTTAACTA TATCCTCAGA 200 

TGGGGAAGTT TGGTTTACTG AGAATGCAGC AAACAAAATA GGGAGGATTA 250 

CAAAAAAAGG GATTATTAAG GAATATACAT TGCCTAACCC AGATTCAGCA 300 

CCCTACGGTA TTACAGAAGG ACCAAATGGA GATATATGGT TTACAGAAAT 350 

GAATGGCAAC CGTATTGGAC GTATTACGGA CGACGGTAAA ATTCGTGAAT 400 

35 ACGAGCTGCC TAATAAAGGA TCTTACCCTT CTTTTATCAC TTTGGGTTCT 450 

GATAATGCCC TGTGGTTCAC AGAAAATCAA AATAATGCTA TTGGTAGAAT 500 

TACAGAAAGT GGGGATATTA CAGAGTTTAA AATTCCTACA CCTGCATCAG 550 

GACCAGTTGG TATTACAAAG GGGAACGACG ATGCTTTATG GTTTGTGGAA 600 

ATTATCGGTA ATAAGATAGG GCGAATAACT CCTCTGGGGG AAATTACCGA 650 

40 ATTCAAAATT CCAACGCCAA ACGCTCGACC TCATGCAATT ACTGCTGGAG 700 

CAGGAATTGA TTTATGGTTT ACTGAATGGG GGGCTAATAA AATAGGAAGG 750 

CTGACAAGCA ATAATATAAT TGAGGAATAC CCAATTCAAA TCAAAAGTGG 800 

TGAACCACAT GGCATTTGTT TCGATGGTGA AACAATTTGG TTTGCAATGG 850 

AGTGTGACAA GATAGGCAAA TTAACTCTCA TTAAGGATAA TATGGAGTGA 900 

45 



2) INFORMATION FOR SEQ ID NO: 1634 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1634 
TTAACTTGTC TATTCCCGAT TCAGG 



2) INFORMATION FOR SEQ ID NO: 1635 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1635 
GCTGTGGCAA TGGATATTCT GTA 



2) INFORMATION FOR SEQ ID NO: 1636 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1636 
TTCCTACCCC TGATGCTAAA GTGA 



2) INFORMATION FOR SEQ ID NO: 1637 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1637 

843 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 
CAAAGTGCGT TATCCGAACC TAA 



PCT/CA00/01150 



2) INFORMATION FOR SEQ ID NO: 1638 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 527 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: WSA-172 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1638 

GGTGGGGAAG ACTGTCTTCA TCCAGGAATT GATTGTGAGT CGTTCCACAT 50 

GCTCACCTAG TTTTCGCTCG ATCTTTTCAC TAACGCAAAC CATGTAGAAC 100 

AACATTGCCA AGGCCCACGG TGGTTACTCC GTCTTCACTG GTGTTGGTGA 150 

GCGTACTCGT GAGGGTAACG ATCTGTACCA CGAAATGCAG GAGACTGGTG 200 

TCATTCAGCT CGAGGGTGAA TCCAAGGTCG CACTGGTGTT CGGACAGATG 250 

AACGAGCCCC CCGGTGCCCG TGCCCGTGTC GCCCTTACCG GTCTGACCAT 300 

TGCCGAGTAC TTCCGTGACG AGGAGGGTCA GGACGTGCTG CTCTTCATTG 350 

ACAACATTTT CCGTTTCACC CAGGCCGGTT , CTGAGGTGTC TGCCCTTCTC 400 

GGTCGTATCC CCTCTGCCGT CGGTTACCAG CCCACCCTGG CCGTCGACAT 450 

GGGTGGTATG CAGGAGCGTA TCACCACCAC CAAGAAGGGT TCTATTACCT 500 

CCGTCCARGC CGTCTACGTC CCCGCGA 527 



2) INFORMATION FOR SEQ ID NO: 1639 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 452 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: ATCC 64746 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1639 

ACTGTCTTGA TTCAAGAACT TATTAACAAC ATTGCTAAAG CCCATGGTGG 50 
TTACTCTATT TTCTGTGGTG TTGGTGAACG TACTCGTGAA GGTAACGATT 100 
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TATACCACGA AATGATGGAA 
AAGTGTGCTC TTGTATTCGG 
CCGTGTTGCT TTAACTGGTT 
AAGGTCAAGA TGTGTTACTT 
GCTGGTTCTG AAGTATCTGC 
TTACCAACCC ACTTTATCTA 
CTACTACCAA GAATGGTTCC 
GC 



ACTGGTGTCA TTAAACTTGA AGGTGACTCC 
TCAAATGAAC GAACCTCCTG GTGCTCGTGC 
TAACCATTGC TGAATACTTC CGTGATGAAG 
TTCATTGATA ACATTTTCCG TTTCACTCAA 
CCTTTTAGGT CGTATTCCAT CTGCTGTAGG 
CTGATATGGG TGGTATGCAA GAACGTATTA 
ATTACCTCTG TACARGCCGT CTACGTCCCC 



150 
200 
250 
300 
350 
400 
450 
452 



10 



15 



20 



25 



30 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1640 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 783 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

<ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus my co ides 

(B) STRAIN: ATCC 6462 

(C) ACCESSION NUMBER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1640 

ACGCCCTTAC GGTAAAACAA AGCAACGAAA ACGGAAGCAT GAACTTAACA 50 

TTTGAAGTTG CACTTCATTT AGGTGATGAT ACAGTTCGTA CAGTTGCGAT 100 

GTCTTCCACA GATGGACTTG TTCGTGGCAC AGAAGTAGAA GATACTGGTA 150 

AAGCAATCTC TGTACCAGTT GGTGATGCAA CACTTGGTCG TGTATTCAAC 200 

GTATTAGGTG ATGCAATTGA CTTAGATGGT GAACTTCCTG CGGATGTACA 250 

CCGTGATCCA ATTCACCGTC AAGCACCTGC ATTCGAAGAA TTATCTACTA 300 

AAGTAGAAAT TCTTGAAACT GGTATTAAAG TAGTAGACTT ACTTGCTCCT 350 

TACATTAAGG GTGGTAAGAT CGGCCTATTC GGTGGTGCCG GCGTAGGTAA 400 

AACAGTATTA ATTCAAGAAT TAATTAACAA CATCGCACAA GAGCACGGTG 450 

GTATCTCTGT ATTCGCTGGT GTAGGTGAGC GTACTCGTGA GGGTAATGAC 500 

TTATACCATG AAATGAGCGA TTCTGGCGTA ATCAAGAAAA CTGCGATGGT 550 

ATTCGGACAA ATGAATGAGC CACCTGGAGC ACGTCAACGT GTTGCATTAA 600 

CAGGTTTAAC AATGGCTGAG CATTTCCGTG ATGAGCAAGG ACAAGACGTA 650 

CTTCTGTTCA TCGATAACAT CTTCCGTTTC ACGCAAGCGG GTTCTGAAGT 700 

ATCTGCCCTT CTTGGTCGTA TGCCATCTGC GGTAGGTTAC CAACCAACAC 750 

TTGCAACAGA AATGGGTCAA TTACAAGAGC GTA 783 



2) INFORMATION FOR SEQ ID NO: 1641 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus mycoides 

(B) STRAIN: NRRL NRS-319 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1641 



10 



ATCTACAACG CCCTTACGGT AAAACAAAGC AACGAAAACG GAGCAAGCAT 50 

TAACTTAACA TTTGAAGTTG CACTTCATTT AGGTGATGAC ACAGTTCGTA 100 

CAGTTGCAAT GTCTTCCACA GATGGACTTG TTCGTGGCAC AGAAGTAGAA 150 

GATACTGGTA AAGCAATCTC TGTACCAGTT GGTGATGCAA CACTTGGTCG 200 

15 TGTATTCAAC GTATTAGGTG ATGCAATTGA CTTAGATGGT GATGTTCCTG 250 

CGGATGTACG TCGTGATCCA ATTCACCGTC AAGCACCTGC ATTCGAAGAA 300 

CTATCTACTA AAGTAGAAAT TCTTGAAACT GGTATTAAAG TAGTAGACTT 350 

ACTTGCTCCT TACATTAAGG GTGGTAAGAT CGGTCTATTC GGTGGTGCCG 400 

GTGTAGGTAA AACAGTATTA ATTCAGGAAT TAATTAACAA CATCGCACAA 4 50 

20 GAACACGGTG GTATCTCTGT ATTCGCTGGT GTAGGTGAGC GTACTCGTGA 500 

GGGTAACGAC TTATACCATG AAATGAGCGA TTCTGGCGTA ATTAAGAAAA 550 

CTGCGATGGT ATTCGGACAA ATGAATGAGC CACCTGGAGC ACGTCAACGT 600 

GTTGCATTAA CAGGTTTAAC AATGGCTGAA CATTTCCGTG ATGAGCAAGG 650 

ACAAGACGTA CTATTGTTCA TCGATAACAT CTTCCGTTTC ACGCAAGCAG 700 

25 GTTCTGAAGT ATCTGCCCTT CTTGGTCGTA TGCCATCTGC GGTAGGTTAC 750 

CAACCAACAC TTGCAACAGA AATGGGTCAA TTACAAGAGC GTATTACATC 800 

TACAAATAAA GGATCTATCA CGT 823 



30 

2) INFORMATION FOR SEQ ID NO: 1642 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 829 bases 
35 <B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

40 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus mycoides 

(B) STRAIN: NRRL BD-15 



45 (xi) SEQUENCE DESCRIPTION: SEQ 

GATGGCGGAA AGCTACCAGA AATCTACAAC 
CAACGAAAAC GGAGCAAACA TTAACTTAAC 
TAGGTGATGA TACAGTTCGT ACAGTTGCGA 
50 GTTCGTGGCA CAGAAGTAGA AGATACTGGT 
TGGTGATGTA ACACTTGGTC GTGTATTCAA 
ACTTAGATGG TGAAGTTCCT GCGGATGTAC 
CAAGCACCTG CATTCGAAGA ATTATCTACT 

84 



ID NO: 1642 

GCCCTTACGG TAAAACAAAG 50 

ATTTGAAGTT GCACTTCATT 100 

TGTCTTCCAC AGATGGACTT 150 

AAAGCAATCT CTGTACCAGT 200 

CGTATTAGGT GATGCAATTG 250 

GTCGTGATCC AATTCACCGT 300 
AAAGTAGAAA TTCTTGAAAC . 350 
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TGGTATTAAA GTAGTAGACT TACTTGCTCC 
TTGGTCTATT CGGTGGTGCC GGTGTAGGTA 
TTAATTAACA ACATCGCACA AGAACACGGT 
TGTAGGTGAG CGTACTCGTG AAGGTAACGA 
5 ATTCTGGCGT AATTAAGAAA ACTGCGATGG 
CCACCTGGAG CACGTCAACG TGTTGCATTA 
GCATTTCCGT GATGAGCAAG GACAAGACGT 
TCTTCCGTTT CACGCAAGCA GGTTCTGAAG 
ATGCCATCTG CGGTAGGTTA CCAGCCAACA 
10 ATTACAAGAG CGTATTACAT CTACAAATA 
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TTACATTAAG GGTGGTAAGA 400 

AAACAGTATT AATTCAGGAA 450 

GGTATCTCTG TATTCGCCGG 500 

CTTATACCAC GAAATGAGCG 550 

TATTCGGACA AATGAACGAG 600 

ACAGGTTTAA CAATGGCTGA 650 

ACTACTGTTC ATCGATAACA 700 

TATCTGCCCT TCTTGGTCGT 750 

CTTGCAACAG AAATGGGTCA 800 

829 



2) INFORMATION FOR SEQ ID NO: 1643 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 
<B) TYPE: Nucleic acid 
(C) STRANDEDNESS : Double 

2 0 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Bacillus pseudomycoides 

(B) STRAIN: NRRL BD-10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1643 

30 AATCTACAAT GCCCTTACGG TAAAACAAAG CAACGAAAAC GGAGCAAGCA 50 

TTAACTTAAC ATTTGAAGTT GCACTTCATT TAGGTGATGA CACAGTTCGT 100 

ACAGTTGCGA TGTCTTCCAC AGATGGACTT GTTCGTGGCA CAGAAGTAGA 150 

AGATACTGGT AAACCAATCT CTGTACCGGT TGGTGATGCA ACACTTGGTC 200 

GTGTATTTAA CGTATTAGGT GATGCAATTG ACTTAGATGG TGAAGTTCCT 250 

35 GCGGATGTAC GCCGTGATCC AATTCACCGT CAAGCACCTG CGTTCGAAGA 300 

GTTATCTACG AAAGTAGAAA TTCTTGAAAC TGGTATTAAA GTAGTAGACT 350 

TACTTGCTCC TTACATTAAA GGTGGTAAAA TCGGTCTATT CGGTGGTGCC 400 

GGTGTAGGTA AAACAGTATT AATCCAGGAA TTAATTAACA ACATCGCACA 4 50 

AGAGCACGGT GGTATTTCTG TATTCGCTGG TGTAGGTGAG CGTACTCGTG 500 

40 AAGGTAATGA CTTATACCAC GAAATGAGCG ATTCTGGCGT AATCAAGAAA 550 

ACAGCGATGG TATTCGGACA AATGAACGAG CCACCTGGTG CACGTCAACG 600 

TGTTGCATTA ACAGGATTAA CAATGGCTGA ACATTTCCGT GATGAGCAAG 650 

GACAAGACGT ACTATTGTTC ATCGATAACA TCTTCCGTTT CACGCAAGCG 700 

GGTTCTGAAG TATCTGCCCT TCTTGGTCGT ATGCCATCTG CGGTAGGTTA 750 

45 CCAACCAACT CTTGCAACAG AAATGGGTCA ATTACAAGAG CGTATTACAT 800 

CTACAAATAA AGGATCTATC ACG 823 



50 2) INFORMATION FOR SEQ ID NO: 1644 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 708 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

<ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus pseudomycoides 

(B) STRAIN: NRRL B-617 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1644 



GGAAAGCTAC CTGAAATCTA CAATGCCCTT AGAATTAAAC AAGACGCAGT 50 

TAACTTAACT TTAGAAGTTG CACTTCACTT AGGTGATGAT ACAGTTCGTA 100 

15 CAGTTGCGAT GTCTTCCACA GACGGACTTG TTCGTGGTAC TGCAGTAGAA 150 

GATACTGGCA AAGCGATTTC TGTTCCAGTT GGTGATGCAA CACTTGGTCG 200 

TGTATTTAAC GTATTAGGTG ATGCAATTGA CTTAGATGGT GAGGTTCCAG 250 

CAGATGTACG CCGTGATCCA ATTCACCGTC AAGCACCTGC ATTCGAAGAG 300 

TTATCTACAA AAGTAGAAAT TCTTGAAACT GGTATTAAAG TAGTAGACTT 350 

20 ACTTGCTCCT TACATTAAAG GTGGTAAAAT CGGTCTATTC GGTGGTGCCG 400 

GTGTAGGTAA AACAGTATTA ATTCAGGAAT TAATTAACAA CATCGCACAA 450 

GAGCACGGTG GTATCTCTGT ATTCGCTGGT GTAGGTGAGC GTACTCGTGA 500 

AGGTAACGAC TTATACCATG AAATGAGCGA TTCTGGCGTA ATCAAGAAAA 550 

CTGCGATGGT ATTCGGACAA ATGAACGAGC CACCTGGTGC ACGTCAACGT 600 

25 GTTGCATTAA CAGGTTTAAC AATGGCTGAA CATTTCCGTG ATGAGCAAGG 650 

ACAAGACGTA CTATTGTTCA TCGATAACAT CTTCCGTTTC ACGCAAGCGG 700 

GTTCTGAA 708 



2) INFORMATION FOR SEQ ID NO: 1645 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 778 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Budvicia aquatica 

(B) STRAIN: ATCC 35567 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1645 

GTACGATGCG CTTGAAGTTC AAAACGATGT CAAACTAGTG CTGGAAGTTC 50 

AACAGCAGCT CGGTGGTGGT GTCGTTCGTT GTATCGCTAT GGGTACCTCA 100 

GATGGCCTAA GCCGCGGTCT GAAAGTACTT GATTTAGAAC ACCCTATCGA 150 

50 AGTTCCTGTC GGTGTGGCAA CTCTGGGCCG TATTATGAAC GTGCTTGGTG 200 

AGCCTATCGA TATGAAGGGT GACATTGGTG AAGAAGAGCG TTGGGCTATC 250 

CATCGCGCTG CACCAAGCTA CGAAGATTTA TCCAGTGCTA ACGAACTGCT 300 

GGAAACGGGC ATCAAGGTTA TCGACCTGAT TTGTCCGTTT GCCAAAGGCG 350 
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GTAAAGTTGG TCTGTTTGGT GGTGCCGGCG TAGGTAAAAC GGTAAACATG" 400 

ATGGAGCTGA TTCGTAATAT TGCGACTGAG CACTCAGGTT ACTCTGTATT 450 

TGCCGGCGTT GGTGAGCGTA CTCGTGAGGG TAATGACTTC TACCACGAAA 500 

TGACAGAATC TAACGTATTA GACAAAGTAT CTCTGGTTTA TGGCCAAATG 550 

AATGAGCCAC CGGGAAACCG TCTGCGCGTT GCGTTAACCG GCCTGACCAT 600 

GGCGGAAAAA TTCCGTGATG AAGGCCGTGA CGTTCTGCTG TTTATCGATA 650 

ACATTTATCG TTATACCTTA GCCGGTACAG AAGTATCAGC ACTGTTAGGT 700 

CGTATGCCAT CAGCGGTAGG TTACCAACCA ACGCTGGCGG AAGAGATGGG 750 

TACACTGCAA GAACGTATCA CYTCAACC 778 



2) INFORMATION FOR SEQ ID NO: 1646 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Buttiauxella agrestis 
25 (B) STRAIN: ATCC 33320 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1646 

GATGCCGTAC CAAAAGTGTA CGACGCTCTT GAGGTTACAA ATGGTAAAGA 50 

30 CCGTCTGGTG CTGGAAGTTC AGCAACAGTT AGGTGGTGGC GTAGTGCGTA 100 

CTATCGCCAT GGGTACTTCT GATGGTTTGC GTCGTGGTCT GGAAGTTTCT 150 

AACCTCGATC ACCCAATTGA AGTGCCAGTA GGTAAAGCAA CTCTGGGCCG 200 

TATCATGAAC GTCCTGGGCG AGCCTATCGA CATGAAAGGC GATATCGGCG 250 

AAGAAGAGCG TTGGGCGATT CACCGTGCTG CTCCTAGCTA CGAAGAACTG 300 

35 TCTAGCTCCC AGGATCTGCT GGAAACCGGC ATCAAAGTAA TGGACCTGAT 350 

TTGCCCGTTC GCTAAGGGTG GTAAAGTCGG TCTGTTCGGT GGTGCGGGTG 400 

TGGGTAAAAC TGTAAACATG ATGGAGCTGA TCCGTAACAT CGCGATCGAG 450 

CACTCCGGTT ACTCAGTGTT TGCAGGCGTG GGGGAACGTA CTCGTGAGGG 500 

TAACGACTTC TACCATGAAA TGACCGATTC CAACGTTCTG GACAAAGTAT 550 

40 CACTGGTTTA TGGCCAGATG AACGAGCCAC CAGGTAACCG TCTGCGCGTT 600 

GCGTTGACCG GTCTGACCAT GGCTGAGAAG TTCCGTGACG AAGGTCGTGA 650 

CGTTCTGCTG TTCGTTGATA ACATTTACCG TTATACCCTG GCCGGTACAG 700 

AAGTATCTGC GCTGCTGGGT CGTATGCCAT CTGCGGTAGG TTACCAGCCA 750 

ACTCTGGCAG AAGAGATGGG TGTTTTGCAG GAGCGTATTA CCTCCACCAA 800 

45 AACTGG 806 



2) INFORMATION FOR SEQ ID NO: 1647 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1122 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida norvegica 

(B) STRAIN: ATCC 36586 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1647 



GAAACCGTTG TTGACACTGG TTCCCCAATC 
AACCTTGGGT CGTATCATTA ACGTTATCGG 
GTCCAATCAA CACCAAGCAA AGAAACCCAA 
TTTGCTGAAC AATCCACTTC TGCTGAAGTT 
TGTCGATTTA TTGGCTCCTT ATGCCAGAGG 
GTGGTGCCGG TGTCGGTAAA ACCGTCTTTA 
ATTGCTAAAG CTCATGGTGG TTTCTCAGTC 
AACCAGAGAA GGTAACGATT TATACCGTGA 
TTAACTTGGA AGGTGACTCT AAAGTCGCTT 
GAACCTCCAG GTGCTAGAGC CCGTGTTGCC 
TGAATACTTC AGAGATGAAG AAGGTCAAGA 
ACATTTTCAG ATTCACCCAA GCTGGTTCAG 
CGTATTCCAT CTGCTGTCGG TTATCAACCA 
TTTGTTGCAA GAACGTATCA CCACCACCAG 
TCCAAGCTGT TTATGTCCCA • GCTGATGATT 
ACCACTTTCG CGCATTTGGA TGC TACT ACT 
AGAATTGGGT ATCTACCCAG CTGTCGATCC 
TATTAGATGC CGCTGTTGTT GGTCAAGAAC 
GTTCAACAAA CTTTACAAGC CTACAAATCT 
TTTAGGTATG GATGAATTAT CAGAACAAGA 
CTAGAAAGAT CCAACGTTTC TTATCTCAAC 
TTCACTGGTA TTCCAGGTAG ATTAGTTAGA 
ATTCAGAGAC GTTTTAGCTG GT 
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1 H^/i I Lav- 1 ort 


1 I Lb I 1 bn 
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pTaTTa a apT 




TPPTaa a BTT 


p>/— rprprp7\rpmp/2 
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TCCAAGAATT 


GATTAACAAC 


300 






Ten 


AATGAAAGAA 


ACCGGTGTCA 


400 


TAGTCTTCGG 


TCAAATGAAC 


450 


TTAACCGGTC 


TAACCATTGC 


500 


TGTCTTGTTA 


TTCGTTGATA 


550 


AAGTCTCTGC 


CTTATTAGGT 


600 


ACCTTGGCCA 


CTGATATGGG 


650 


AAAAGGTTCC 


GTCACTTCTG 


700 


TGACTGATCC 


TGCCCCAGCC 


750 


GTCTTGTCTC 


GTGGTATCTC 


800 


ATTGGATTCC 


AAATCAAGAT 


850 


ATTACGATGT 


TGCTACTCAA 


: 900 


TTACAAGATA 


TTATCGCTAT 


950 


TAAATTGACC 


GTTGAAAGAG 


1000 


CATTTGCTGT 


CGCTGAAGTT 


1050 


TTGAAAGAAA 


CCGTTGCTTC 


1100 
1122 



2) INFORMATION FOR SEQ ID NO: 1648 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: ATCC 700677 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1648 
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TACGTCTCTT ACCCTGACGA TGATTTGCAA 
TGTTTCAAAT GGTAAAGTCA TCGCCCAACT 
GTAACGTTTC ATTTGGTACC AACCAAGCTG 
GGATCAACTA TGAAACCAAT CACAGACTAT 
5 TGTCTACGAT TCAACTGCTA CTATCGTTCA 
CTGGGACAGA TATCCCTCTC TATAACTGGG 
ATTACTCTGC AATATGCCCT TCAACAATCT 
AACACTAAAC AAGGTCGGTC TAGATAAGGC 
TTGGTATCGA CTATCCAAGC ATGCATTATG 

10 ACAACTGAAT CCAACAAAAA ATATGGTGCA 
CGCCTATGCC GCATTCGCAA ATGGTGGTAT 
TCAATAAAGT TGTCTTTAGC GATGGTAGCG 
GGCACAAGAG CCATGAAAGA AACGACTGCT 
GAAAACAGTC TGGACGTACG GAACTGGTCG 

15 TTCCTCAAGC TGGTAAAACA GGTACCTCTA 
GAAAAGTATA TCAAGAACAC TGGTTACGTA 
GGGTTATACC CGT 



PCT/CA00/01150 

GTCGCATCTA CGGTCGTAGA 50 

TGGAGCTCGT CACCAAGCAA 100 

TGGAAACCAA TCGTGACTGG 150 

GCTCCTGCCT TGGAGTACGG 200 

CGATGAGCCC TATAACTACC 250 

ATCGAGCATA TTTCGGTAAT 300 

CGTAACGTAC CTGCCGTTGA 350 

TAAAACCTTC CTTAATGGTC 400 

CAAACGCCAT TTCAAGTAAT 450 

AGTAGTGAAA AAATAGCTAC 500 

TTACCACAAA CCAATGTACA 550 

AAAAAGAATT TTCTGACCCT 600 

TACATGATGA CAGAAATGAT 650 

TGGTGCCTAC CTGCCTTGGC 700 

ACTATACTGA CGAAGAAATT 750 

GCTCCAGATG AAATGTTTGT 800 

813 



20 



2) INFORMATION FOR SEQ ID NO: 1649 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 813 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter lari 

(B) STRAIN : ATCC 43675 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1649 

ATTAATGAAG CTATTGTTGT TAATTATGAA TTAGAAGGAA AAGAATGCAA 50 

GCTAGTTCTT GAAGTAGCTG CACATTTAGG CGATAATAAA GTAAGAACCA 100 

TCGCTATGGA TATGACAGAT GGTCTTGTTA GAGGTTTAAC AGCCGTCGCA 150 

40 ACTGGAAATC CAATTAGTGT TCCAGTAGGC GAAAAAGTTC TTGGAAGAAT 200 

TTTTAATGTA ACGGGTGATT TGATTGATGA GGGCGAAGAA ATCAATTTTG 250 

ATAAGCACTG GTCAATTCAT AGAGATCCAC CTCCATTTGA AGAACAAAGT 300 

ACAAAAAGCG AAATCTTTGA AACAGGTATA AAGGTTGTTG ATTTGCTAGC 3 50 

TCCTTATGCT AAAGGTGGAA AAGTTGGTCT TTTTGGTGGT GCAGGTGTTG 400 

45 GTAAAACCGT TATTATTATG GAATTAATTC ACAATGTTGC ATTTAAACAT 450 

AGCGGATATT CTGTTTTTGC AGGTGTTGGC GAAAGAACTC GTGAGGGTAA 500 

TGACCTTTAC AATGAAATGA AAGAAAGTAA TGTATTAGAT AAAGTTGCAT 550 

TGTGTTATGG TCAAATGAAT GAACCACCAG GGGCAAGAAA TCGTATAGCT 600 

TTAACAGGTC TTACTATGGC TGAGTATTTT AGAGATGAAA TGGGACTTGA 650 

50 TGTTTTAATG ATTATTGATA ATATTTTCAG ATTTTCTCAA TCAGGCTCAG 700 

AAATGTCAGC ACTTTTAGGA AGAATTCCTT CAGCTGTTGG TTAT CAACCA 750 

ACCTTAGCTA GTGAAATGGG TAAGTTCCAA GAAAGAATTA CTTCAACCAA 800 

GAAAGGATCT ATT 813 
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2) INFORMATION FOR SEQ ID NO: 1650 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 570 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY:- Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coccidioides immitis 

(B) STRAIN: WSA-222 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1650 



GGTGGGGAAG ACTGTGTTCA TTCAGGAATT 
ACTAAAGCCG ACAGCGGTTG CTGATATGCT 
CTCACGGTGG TTACTCCGTG TTCACTGGTG 
GGTAACGATT TGTACCATGA AATGCAAGAG 
CGGAGAGTCC AAGGTCGCTC TTGTCTTCGG 
GTGCCCGTGC CCGTGTTGCC CTTACCGGTT 
CGTGACGAGG AAGGCCAAGA CGGTAGGCTT 
CGTGTGATAC AGGAGGCTAA TCGCTTTTCT 
ACATTTTCCG TTTCACTCAA GCTGGTTCTG 
CGTATTCCTT CCGCTGTCGG. TTACCAACCT 
TGTTATGCAG GAACGTATCA CCACCACCAC 
TGCARGCCGT CTACGTCCCC 



GATTGTAAGT TCTGTTATCA 50 

CTAGAACAAC ATTGCTAAGG 100 

TCGGTGAGCG TACCCGTGAG 150 

ACCCGTGTCA TTCAACTCGA 200 

TCAAATGAAC GAGCCCCCTG 250 

TGACCATTGC TGAATACTTC 300 

CATGCTTCTA TCGCTAGGGG 350 

AGTGCTTCTC TTTATTGACA 400 

AAGTGTCTGC CTTGCTCGGT 450 

ACTCTCGCCG TCGACATGGG 500 

CAAGGGATCC ATTACTTCAG 550 

570 



2) INFORMATION FOR SEQ ID NO: 1651 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Emmons ia parva 

(B) STRAIN: ATCC 10784 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1651 

TGTCTTCATC CAGGAGTTAA TTGTACGTTT TTTGTCTGCC TACTATAATA 50 

GACGAAGAAA TATTTTGATT TATTTTTCTA ATAATTCAAA TAGAACAACA 100 

TTGCCAAAGC CCACGGTGGT TACTCTGTCT TCACTGGTGT TGGCGAGCGG 150 

ACCCGTGAAG GAAACGATCT GTACCACGAA ATGCAGGAGA CCCGTGTCAT 200 

CCAGCTCGAT GGCGAGTCCA AGGTCGCCCT CGTCTTCGGT CAAATGAACG 250 

AACCCCCCGG AGCCCGTGCC CGTGTTGCCC TCACTGGTCT GACCGTTGCT 300 
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SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



GAATACTTCC GTGACGAGGA 
CCAGTAATTT GACTCGAAGC 
CATCGACAAC ATTTTCCGCT 
TGCTCGGCCG TATCCCCTCC 
GACATGGGTA TGATGCAGGA 
CACCTCCGTG 



AGGTCAAGAT GGTGCGTATA TATATATTCG 
TCCACTCACA CATATATTAG TGCTCCTCTT 
TCACCCAGGC AGGTTCCGAA GTGTCCGCCC 
GCCGTCGGTT ACCAGCCCAC CCTCGCTGTC 
ACGTATCACC ACCACCACCA AGGGCTCCAT 



350 
400 
450 
500 
550 
560 



2) INFORMATION FOR SEQ ID NO: 1652 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Erwinia amylovora 

(B) STRAIN: ATCC 14976 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1652 

CCACAAGTGT ACAGCGCCCT TGAGGTTAAA AATGGTGATG CTCGTCTGGT 50 

GCTGGAAGTT CAGCAGCAGC TGGGCGGTGG CGTGGTTCGT ACCATCGCCA 100 

TGGGTTCTTC AGACGGCCTT AAGCGTGGTC TGGAAGCCGT TGACCTTCAG 150 

CACCCAATTG AAGTACCGGT AGGTACTGCC ACACTTGGCC GTATCATGAA 200 

CGTGCTGGGT GAGCCGATCG ATATGAAAGG CGACATTGGC GAAGAAGAGC 250 

GCTGGGCGAT TCACCGCTCT GCACCTTCTT ACGAAGATCA GTCGAACTCT 300 

CAGGATCTGC TGGAAACCGG CATCAAGGTG ATTGACCTGA TGTGTCCGTT 350 

CGCCAAGGGC GGTAAAGTCG GCTTGTTCGG TGGTGCGGGC GTAGGTAAAA 400 

CCGTCAACAT GATGGAGCTT ATTCGTAACA TTGCGGCTGA GCACTCAGGT 4 50 

TTCTCG'GTAT TTGCCGGTGT GGGTGAGCGT ACCCGTGAAG GTAACGACTT 500 

CTACCACGAA ATGACCGACT CCAACGTTAT CGACAAAGTT TCGCTGGTCT 550 

ATGGTCAGAT GAACGAGCCA CCGGGTAACC GTCTGCGCGT TGCGCTGACC 600 

GGTCTGACCA TGGCGGAGAA GTTCCGTGAC GAAGGTCGTG ACGTACTGCT 650 

GTTTATCGAT AACATCTATC GTTACACCCT GGCCGGTACT GAAGTCTCCG 700 

CTCTGCTGGG TCGTATGCCT TCTGCGGTAG GTTACCAGCC AACGCTGGCG 750 

GAAGAGATGG GCGTTCTGCA GGAACGTATC 780 



2) INFORMATION FOR SEQ ID NO: 1653 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 545 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fonsecaea pedrosoi 

(B) STRAIN: ATCC 18831 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1653 



TGTGTTCATT CAGGAGCTGA TTGTGAGTAC 
TGCGCATGAA GCAAGCGCTG ACGTCCATCT 

10 CACGGTGGTT ACTCCGTGTT CTGCGGTGTC 
TAACGATTTG TACCACGAAA TGCAGGAGAC 
GCGAGTCCAA GGTCGCCCTT GTCTTCGGTC 
GCCCGTGCCC GTGTCGCCCT TACTGGTCTT 
GACAACCAGA AGCGAGTATT GCCACAATTA 

15 TTCCGTGACG AGGAGGGCCA GGATGTGCTT 
CCGTTTCACC CAGGCCGGTT CTGAGGTGTC 
CCTCTGCCGT CGGTTACCAG CCCACTCTCG 
CAGGAGCGTA TCACCACCAC CCAGAAGGGT 



CCCGGAGATT TTCCTGCGAT 50 

AGAACAACAT CGCCAAGGCT 100 

GGCGAGCGTA CTCGTGAGGG 150 

CGGTGTCATC AACCTCGAGG 200 

AGATGAACGA GCCCCCGGGA 250 

ACCGTCGCTG AGTAAGTTTT 300 

CTGACTAAAA ATCAAGATAT 350 

CTCTTCATTG ACAACATTTT 400 

CGCTCTTCTC GGCCGTATTC 450 

CCGTCGACAT GGGTATGATG 500 

TCCATCACTT CCGTC 545 



20 



35 



2) INFORMATION FOR SEQ ID NO: 1654 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 564 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium moniliforme 

(B) STRAIN: WSA-213 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1654 



TGTGTTTATT CAGGAGCTCA TCAACAACAT CGCCAAGGCC CACGGTGGTT 50 

ACTCTGTCTT CACTGGTGTC GGTGAGCGAA CCCGTGAGGG TAACGATCTG 100 

40 TACCACGAAA TGCAGGAGAC TTCCGTTATT CAGCTTGATG GCGAGTCCAA 150 

GGTTGCCCTG GTTTTCGGTC AGATGAACGA GCCCCCTGGA GCTCGTGCCC 200 

GTGTCGCTCT TACCGGGTAA GTTGATAGAT AGTGCCTTCC CTTCCTGTTT 250 

CAACACCTCA ACACCACCAC TCCCCAAGAC ATTGCTACTC ATACTGCACC 300 

ATGATATTAT ATTTACGCCT CTTGGACGCT AGCTAATGTT GTATCGACAG 350 

45 TTTGACTGTT GCTGAGTACT TCAGAGACGA GGAGGGTCAG GACGTGCTGC 400 

TTTTCATTGA CAACATTTTC CGATTCACTC AGGCCGGTTC CGAGGTGTCT 450 

GCCCTTCTCG GTCGTATCCC CTCTGCCGTC GGTTACCAGC CCACTCTGGC 500 

CGTCGACATG GGTGGTATGC AGGAGCGTAT TACCACCACC ACCAAGGGTT 550 

CCATTACCTC AGTC 564 

50 



2) INFORMATION FOR SEQ ID NO: 1655 

854 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 776 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 13182 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1655 



CGTACCGCGC GTGTACGAGG 
TGGTGCTGGA AGTTCAGCAG 
GCCATGGGTT CTTCCGACGG 
CGAGCATCCG ATCGAAGTCC 
TGAACGTACT GGGCCAACCG 
GAGCGTTGGG CGATTCACCG 
CTCTCAGGAA CTGCTGGAAA 
CGTTTGCGAA GGGCGGTAAA 
AAAACCGTAA ACATGATGGA 
CGGTTACTCC GTGTTTGCGG 
ACTTCTACCA CGAAATGACC 
GTGTATGGCC AGATGAACGA 
GACCGGCCTG ACCATGGCTG 
TGCTGTTCGT CGATAACATC 
TCCGCACTGC TGGGTCGTAT 
GGCGGAAGAG ATGGGCGTTC 



CTCTTGAGGT ACAAAATGGT 
CAGCTCGGCG GCGGTATTGT 
TCTGCGTCGC GGTCTGGAAG 
CGGTAGGTAA AGCAACGCTG 
GTAGACATGA AAGGCGACAT 
CGCAGCGCCT TCCTACGAAG 
CCGGCATCAA AGTTATCGAC 
GTTGGTCTGT TCGGTGGTGC 
GCTGATCCGT AACATCGCGA 
GCGTAGGTGA ACGTACTCGT 
GACTCCAACG TTATCGATAA 
GCCGCCGGGA AACCGTCTGC 
AGAAGTTCCG TGACGAAGGT 
TATCGTTACA CCCTGGCCGG 
GCCTTCAGCG GTAGGTTACC 
TGCAGG 



AGTGAGAATC 50 

TCGTACCATC 100 

TCAAAGACCT 150 

GGTCGTATCA 200 

CGGCGAAGAA 250 

AGTTGTCAAA 300 

CTGATGTGTC 350 

GGGTGTAGGT 400 

TCGAGCACTC 450 

GAGGGTAACG 500 

AGTATCCCTG 550 

GCGTTGCGCT 600 

CGTGACGTTC 650 

TACTGAAGTA 700 

AGCCGACTCT 750 

776 



2) INFORMATION FOR SEQ ID NO: 1656 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 572 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microsporum audouinii 

(B) STRAIN: ATCC 11347 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1656 

GTGTTCATCC AGGAGTTGAT TGTAAGTGAT TATATTCCCC TAGAAAGAAA 50 
TGTTTTGAAC AAAAGTCTCG AATTAGAAAA TTCTTTTCAG ATACTAATTT 100 
ACTATAGAAC AACATTGCCA AGGCTCACGG TGGTTACTCC GTCTTCACCG 150 
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GTGTCGGAGA GCGAACCCGT 
GAAACTCGTG TCATCCAACT 
CGGTCAGATG AACGAGCCCC 
GTTTGACCAT TGCTGAGTAC 
TTCTTTAAAT TAGATATCTT 
AGTGCTTCTC TTCATCGACA 
AAGTGTCTGC CCTGCTTGGT 
ACTCTTGCCG TCGACATGGG 
GAAGGGATCC ATTACCTCCG 



GAAGGAAACG ATCTGTACCA TGAAATGCAG 
TGATGGCGAG TCCAAGGTCG CCCTGGTCTT 
CAGGTGCCCG TGCCCGTGTT GCTCTTACTG 
TTCCGTGATG AGGAAGGTCA AGACGGTATG 
CTGGAGAAAC AGCGTCTAAC AAATTCTTCC 
ACATCTTCCG TTTCACTCAG GCTGGTTCCG 
CGTATTCCAT CTGCCGTCGG TTACCAACCC 
TGGTATGCAG GAACGTATTA CCACCACCAA 
TC 



200 
250 
300 
350 
400 
450 
500 
550 
572 



10 



15 



20 



25 



30 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1657 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 790 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Obesumbacterium proteus 

(B) STRAIN: ATCC 12841 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1657 

GCCTAAAGTG TATAACGCAC TTGAGGTGAA AGGCGGTGCC ACTAAACTGG / 50 

TACTGGAAGT TCAGCAGCAG CTAGGCGGCG GCGTTGTACG CTGTATCGCT 100 

ATGGGTACTT CTGACGGTCT GCGTCGCGGA CTGGACGTTG TTGACCTGGA 150 

GCACCCGATT GAAGTCCCAG TAGGTAAAGC GACCTTAGGC CGCATTATGA 200 

ACGTACTGGG TGAGCCAATT GATATGAAGG GTGATATCGG CGAAGAAGAT 250 

CGCTGGGCTA TTCACCGTGA AGCTCCAAGC TACGAAGAAC TGTCTAACTC 300 

GCAAGAACTG CTGGAAACCG GTATCAAGGT AATGGACTTG ATTTGTCCGT 350 

TCGCTAAGGG CGGTAAAGTC GGTCTGTTCG GTGGTGCGGG TGTTGGTAAA 4 00 

ACAGTAAACA TGATGGAGCT GATCCGTAAC ATCGCGATCG AGCACTCAGG 4 50 

TTACTCTGTA TTTGCCGGCG TGGGTGAACG TACTCGTGAG GGTAACGACT 500 

TCTACCACGA AATGACCGAC TCCAACGTAT TGGACAAAGT ATCACTGGTT 550 

TATGGCCAGA TGAACGAGCC ACCAGGAAAC CGTCTGCGCG TTGCGCTGAC 600 

CGGTCTGACT ATGGCTGAGA AGTTCCGTGA CGAAGGTCGT GACGTACTGC 650 

TGTTCATCGA TAACATCTAC CGTTATACCT TGGCCGGTAC CGAAGTATCT 700 

GCACTGCTGG GTCGTATGCC TTCTGCGGTA GGTTATCAGC CAACGCTGGC 750 

GGAAGAGATG GGTGTTCTGC AAGAACGTAT CACCTCTACC 7 90 



2) INFORMATION FOR SEQ ID NO: 1658 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 622 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 

(B) STRAIN: ATCC 200443 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1658 

TGTCTTCATT CAGGAGCTTA TCAACAACAT CGCCAAGGCC CACGGTGGTT 50 

ACTCCGTCTT CACTGGTGTG GGAGAGCGCA CTCGTGAGGG AAACGACTTG 100 

TATCACGAGA TGCAAGAGAC TTCCGTTATC CAGCTCGAAG GCGAATCCAA 150 

GGTTGCCCTC GTCTTCGGTC AAATGAACGA GCCTCCGGGT GCTCGTGCTC 200 

GTGTTGCTCT CACCGGTCGT AAGTGCTCCT TCCCAGATTT CTCTTCCCCA 250 

GTTTCTGGAC CCACTTTTTC CTTCCACCAC CATTCTACTG GGTAGGACCA 300 

AGATAGCACT GCCTATTCTG GTGCCTTCCT ACCGCCTACT CTACTGCCTA 350 

TTCCACCACC TTTTCTACCG CCTCTTCTAC TTGCTATTGT ATACTAACTT 400 

ACTCAAACAG TTACTATTGC TGAGTACTTC CGTGACGCTG AGGGCCAGGA 4 50 

TGTGCTTCTC TTCATCGACA ACATTTTCCG TTTCACCCAG GCCGGTTCCG 500 

AGGTGTCCGC TCTTCTCGGT CGTATCCCCT CCGCCGTCGG TTACCAGCCC 550 

ACCCTTGCCG TCGACATGGG TGGTATGCAG GAGCGTATCA CCACCACCAA 600 

GAAGGGATCC ATTACCTCCG TC 622 



2) INFORMATION FOR SEQ ID NO: 1659 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 794 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Plesiomonas shigelloides 

(B) STRAIN: ATCC 14029 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1659 

GACGCTGTAC CTCAGGTGTA CGATGCACTG ACAGTTGAGG GTGCTGAGCT 50 

GGTAC.TGGAA GTGCAGCAGC AGCTGGGTGG TGGTGTTGTT CGCTGTATCG 100 

CGATGGGTGC CTCTGATGGC CTCAAGCGCG GTCTGAAAGC GCACAATACT 150 

GGTGCTCCTA TCACTGTACC GGTGGGTGTG GAAACACTGG GCCGGATCAT 200 

GGATGTGTTG GGTAACCCGA TTGACCAGAA AGGTCCAATC GGTGAACAAG 250 

ATCGCTGGGT GATCCACCGT GAAGCACCAA GCTACGAAGA TCAGGCTAAC 300 

AGCACTGAAC TGCTGGAAAC CGGTATCAAG GTTATCGACC TGGTATGCCC 350 

GTTTGCGAAA GGCGGTAAAG TCGGTCTGTT CGGTGGTGCC GGTGTAGGTA 4 00 

AAACCGTAAA CATGATGGAG CTGATCCGTA ACATCGCGAT CGAGCACTCC 450 

GGTTATTCCG TGTTTGCGGG CGTGGGTGAG CGTACCCGTG AAGGTAACGA 500 

CTTCTACCAC GAAATGACAG ACTCCAACGT ACTGGACAAA GTATCCCTGG 550 
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TGTACGGTCA GATGAACGAG CCGCCAGGTA ACCGTCTGCG CGTAGCACTG 
ACCGGCCTGA CCATTGCGGA GAAATTCCGT GATGAAGGTC GTGACGTACT 
GCTGTTCATC GATAACATCT ACCGTTATAC CCTGGCGGGG ACCGAAGTAT 
CGGCACTGCT GGGCCGTATG CCTTCTGCGG TAGGTTATCA GCCAACGCTG 
GCGGAAGAGA TGGGTGTACT GCAAGAGCGT ATTACCTCTA CCCG 



600 
650 
700 
750 
794 
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2) INFORMATION FOR SEQ ID NO: 1660 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shewanella putrefaciens 

(B) STRAIN: ATCC 8071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1660 

AGGTATATGA CGCTSTGAAG ATCACAGGTG AAGGCGCCTG TAATGGTTTG 50 

GTGCTGGAAG TTCAGCAACA GCTAGGCGGT GGTGTAGTTC GTACTATCGC 100 

TATGGGTTCT TCTGATGGTC TGCGTCGTGG TCTTGAGGTT GTTAACTCAG 150 , 

GTTCACCTAT TTCTGTTCCT GTTGGTACCG CCACGCTTGG CCGTATCATG 200 

AACGTATTAG GTGAGCCTAT TGATGAAGCG GGTCCAATCG GTGAAGAAGA 250.. 

GCGTTATGTT ATTCACCGTG CAGCACCTTC ATATGAAGAT CAATCGAACA 300 

CTACTGAACT GTTAGAGACA GGTATCAAGG TTATTGACCT TGTTTGTCCA 350 

TTCGCTAAGG GTGGTAAAGT AGGTCTGTTC GGTGGTGCGG GTGTTGGTAA 400 

AACAGTTAAC ATGATGGAAC TGATTAACAA CATCGCTAAA GCTCACTCGG 450 

GTCTTTCGGT GTTCGCCGGT GTGGGTGAAC GTACTCGTGA AGGTAACGAC 500 

TTCTACTACG AGATGAAAGA TTCTGGCGTT CTCGACAAAG TGGCCATGGT 550 

TTATGGTCAG ATGAACGAGC CACCAGGAAA CCGTTTACGC GTAGCACTGT 600 

CAGGTCTGAC AATGGCTGAG AAGTTCCGTG ACGAAGGTCG TGACGTATTG 650 

TTGTTCGTTG ACAACATCTA CCGTTATACC TTAGCCGGTA CTGAAGTATC 700 

TGCACTGTTA GGCCGTATGC CTTCTGCGGT AGGTTATCAA CCAACATTGG 750 

CTGAAGAAAT GGGCGTTCTG CAAGAGCGTA TTACTTCAAC TAAGACGGG 799 



2) INFORMATION FOR SEQ ID NO: 1661 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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WO 01/23604 PCT/CA00/01150 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1661 

TGGGAAGCGA AAATCCTG 18 



2) INFORMATION FOR SEQ ID NO: 1662 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 774 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter curvus 

(B) STRAIN: ATCC 35224 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1662 



CTATGCCTCA AACTAGAGAG CATATCTTGC TATCTCGCCA AGTAGGCGTT 50 

CCATATATCG TTGTATTTAT GAACAAAGCC GATATGGTCG ATGACGCTGA 100 

25. GCTTCTTGAG CTAGTCGAGA TGGAAATTCG CGAGCTTCTT AACGAGTACA 150 

ACTTCCCTGG CGATGATACT CCTATCATAT CAGGTTCTGC TCTTAAAGCC 200 

CTCGAAGAGG CTAAAGCAGG CGTTGATGGC GAGTGGTCAG CAAAAGTTCT 250 

TGAGCTTATG GATAAAGTCG ATGAGTATAT CCCAACTCCA GTTCGTGCTA 300 

CCGATAAAGA CTTCCTGATG CCTATCGAAG ACGTTTTCTC TATCTCAGGT 350 

10 CGTGGAACGG TCGTTACTGG TAGGATCGAA AAAGGTGTCG TAAAAGTTGG 4 00 

CGATACTATC GAGATCGTTG GTATCAAACC TACTCAAACT ACGACAGTTA 450 

CTGGCGTTGA GATGTTTAGG AAAGAGATGG AACAAGGCGA GGCCGGTGAT 500 

AACGTAGGTG TTCTTTTAAG AGGTACTAAA AAAGAAGACG TCGAGCGCGG 550 

CATGGTTCTT TGTAAGCCAA AATCAATCAC TCCTCATACA AAATTTGAGG 600 

35 GTGAGGTTTA TATCCTAACA AAAGAGGAAG GCGGACGCCA CACTCCATTC 650 

TTTAACAACT ATAGACCACA ATTTTATGTA AGAACAACAG ACGTTACAGG 700 

TTCTATCACA CTTCCAGAAG GAACTGAGAT GGTTATGCCT GGAGATAATG 750 

TCAGAATTTC CGTTGAACTC ATCG 77 4 



2) INFORMATION FOR SEQ ID NO: 1663 



(i) SEQUENCE CHARACTERISTICS: 
IS (A) LENGTH: 7 91 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter rectus 
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(B) STRAIN: ATCC 33238 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1663 

5 TTCTGCGGCT GACGGCCCAA TGCCACAAAC TAGAGAGCAC ATCTTGCTAT 50 

CTCGCCAAGT AGGCGTTCCG TATATCGTTG TTTTTATGAA CAAAGCCGAT 100 

ATGGTCGATG ATGCCGAGCT TCTTGAGCTG GTTGAGATGG AGATTCGCGA 150 

GCTTCTAAAC GAGTATGATT TCCCTGGTGA CGATACTCCA ATCGTAGCAG 200 

GCTCTGCTCT TCAAGCTCTT AATGAAGCCA AAGCCGGAAC AGAAGGCGAG 250 

10 TGGTCTGCAA AAATTCTTGA GCTTATGGCT AAAGTTGACG AGTATATCCC 300 

GACTCCGGTT CGTGCAACGG ATAAAGACTT CTTGATGCCT ATTGAGGACG 350 

TTTTCTCTAT CTCCGGTCGC GGCACCGTCG TTACCGGCAG AATCGAAAAA 400 

GGTATCGTAA AAGTCGGTGA TACTATCGAG ATCGTAGGTA TCCGCGATAC 4 50 

TCAAACAACT ACCGTTACCG GCGTTGAGAT GTTCAGAAAA GAGATGGATC 500 

15 AAGGCGAAGC GGGCGATAAC GTAGGCGTTC TTCTAAGAGG CACTAAAAAA 550 

GAAGACGTTG AGCGCGGTAT GGTTCTTTGC AAACCTAAAT CAATCACTCC 600 

TCACACTAAA TTTGAGGGAG AGGTTTATAT CTTAACTAAA GAGGAAGGCG 650 

GACGCCATAC TCCATTCTTT AATAACTATA GACCGCAGTT TTATGTAAGA 7 00 

ACTACCGACG TTACCGGTTC TATCACTCTT CCGGAAGGAA CAGAGATGGT 7 50 

20 TATGCCTGGC GATAACTTAA AGATAAGCGT TGAGCTTATC G 7 91 



2) INFORMATION FOR SEQ ID NO: 1664 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Fonsecaea pedrosoi 

(B) STRAIN: ATCC 18831 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1664 

40 CGACGGACAG ATGCCCCAGA CCAGGGAGCA CTTGCTCCTC GCCCGCCAGG 50 

TCGGTGTCAA GCGCATTGTC GTCTTCGTCA ACAAGGTCGA TGCCATTGAG 100 

GACAAGGAGA TGTTGGAGCT CGTCGAGATG GAGATGCGTG AGCTTCTCTC 150 

CAGCTACGGC TTCGAGGGTG ACGACACTCC CATCGTCATG GGTTCCGCCC 200 

TTTGCGCCAT TGAGGGCCGC GAGCCCGACA TTGGTGTCGA GAAGATTGAC 250 

45 GAGCTCCTCG AGCACGTCGA CACCTGGATC CCCACCCCCG AGCGTGACAT 300 

CGCCAAGCCT TTCCTCATGT CCGTTGAGGA CGTCTTCTCC ATTCCCGGCC 350 

GTGGTACCGT CGCTTCTGGC CGTGTCGAGC GTGGTGTCCT GAAGAAGGAT 400 

TCCGAAGTCG AGCTTGTCGG CAAGAACAAG AACCCCATCA AGACCAAGGT 4 50 

TACCGACATC GAGACCTTCA AGAAGTCTTG CGACGAGTCC CGCGCTGGTG 500 

50 ACAACTCCGG TCTCCTTCTC CGTGGTGTCA AGCGTGACGA TGTCCTCCGT 550 

GGCATGGTCG TTGTCCAGCC CGGCACCACC AAGGCCCACA AGAAGTTCCT 600 

TGCCTCCATG TACGTCCTCA CCAAGGAGGA GGGTGGCCGC CACACTGGTT 650 

TCGCCAACAA CTACAAGCCC CAGATGTTCA TCCGTACCGC CGATGAGGCC 700 
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GAGGACAAGA TGGTCATGCC 750 
CCACAAGCCC ATTGCCGTCG 800 

810 



2) INFORMATION FOR SEQ ID NO: 1665 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1665 
CAGTACAGGT AGACTTCTG 



2) INFORMATION FOR SEQ ID NO: 1666 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 888 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Microsporum audouinii 

(B) STRAIN: ATCC 11347 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1666 

ATGATTGCGA AACCTACTTG CTGTGGAAGA ATTTGGATAT TCTAACATTT 50 

CTCTAGGCCT CAAACCAGAG AGCATCTGCT CCTTGCCCGC CAGGTCGGTG 100 

TTCAGAAGCT CGTCGTTTTC GTTAACAAGG TCGACGCTGT TGAGGACCCA 150 

GAGATGTTGG AACTTGTCGA GCTAGAGATG CGTGAGCTGC TCAGCCACTA 200 

TGGTTTCGAG GGTGAGGAGA CCCCAATCAT TTTTGGCTCT GCTCTCTGCG 250 

CCCTTGAATC TCGACGACCA GAATTGGGTG TTGAGAAGAT CGATGAGCTA 300 

TTGAACGCTG TGGATACCTG GATTCCCACC CCAGAGCGTG CCACTGATAA 350 

GCCTTTCCTT ATGTCCATTG AGGAAGTTTT CTCCATCTCT GGTCGTGGTA 400 

CCGTCGTTTC CGGTCGTGTC GAGCGTGGTA TCCTCAAGAA GGACTCTGAT 4 50 

GTCGAAATTG TGGGTGGATC TGATACACCC ATCAAGACGA AGGTCACCGA 500 

CATTGAAACC TTCAAGAAGT CTTGTGACGA ATCCCGAGCT GGTGACAACT 550 

CCGGTCTACT TCTCCGAGGT GTCAAGCGTG AGGACTTGAG ACGTGGAATG 600 

GTTGTTGCTG CTCCCGGATC GACCAAGGCT CATACCGACT TCATGGTCTC 650 

CCTTTATGTT CTGACCGAGG CTGAGGGTGG CCGTTCCAAT GGATTCACCC 700 

ACAAGTACCG CCCACAGATG TTCATCCGTA CTGCCGGTAT GTAAACCCTT 750 
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TTTCTACCAT TCACTTTGTT TCACCACTGA CTTGTATACT TTACCGCAGA 
CGAAGCCGCA TCTTTCAGCT GGCCTGGAGA GGATCAAGAC AAGAAGGCCA 
TGCCTGGTGA CAATGTCGAG ATGATTTGCA AGACCCTC 
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2) INFORMATION FOR SEQ ID NO: 1667 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 793 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Piedraia hortai 

(B) STRAIN: ATCC 24292 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1667 

ATGCCGCAGA CCCGCGAGCA CTTGCTGCTC GCCCGTCAGG TCGGTGTCCA 50 

GAAGATCGTT GTTTTCGTCA ACAAGGTTGA TGCTATCGAC GACCCGGAGA 100 

TGCTGGAGCT TGTCGAGATG GAGATGCGTG AACTTCTCAG CACATACGGT 150 

TTCGAGGGTG ACGAGACCCC TGTTATTATG GGCTCCGCGC TCATGGCTCT 200 

CAACAACCAG CGCCCCGAGA TTGGTCAACA GAAGATTGAT GAACTCATGG 250 

CCGCTGTCGA CGAGTGGATC CCTACTCCCC AGCGTGACCT CGACAAGCCT 300 

TTCCTGATGT CTGTTGAGGA ;TGTCTTCTCC ATTGCTGGCC GTGGTACCGT 350 

TGTGTCCGGC CGTGTGGAGC GCGGTACCCT CAAGCGTGAT GAGGAAGTCG 4 00 

AGCTTGTCGG CAAGGGTGTC GACCCCATCA AGACCAAGGT CACCGATATC 450 

GAGACTTTCA AGAAGTCCTG CGAGGAGGCT CAGGCTGGTG ACAACTCTGG 500 

TCTTCTGATC CGTGGTGTCC GCCGCGAGGA TGTTCGTCGC GGTATGGTTG 550 

TCTCCAAGCC CGGCACCGTC AAGTCTCACA CTCAGTTCCT GGCCTCGCTT 600 

TACGTTCTCA CCAAGGAGGA GGGTGGTCGC CACACTGGTT TCGGCGAGCA 650 

CTACCGTCCC CAGCTCTACC TCCGTACCTC AGACGAGTCT GTCGATCTGA 700 

CCTTCCCCGA GGGAACTGAG GATCACCACT CCAAGATCGT CATGCCTGGT 7 50 

GACAACATCG AGATGGTCGT CACGATGACT CACGCCAACG CTA 793 



2) INFORMATION FOR SEQ ID NO: 1668 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
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(B) STRAIN: K-12 KL1699 

(C) ACCESSION NUMBER: J01717 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1668 

AACATGATCA CCGGTGCTGC GCAGATGGAC GGCGCGATCC TGGTAGTTGC 50 

TGCGACTGAC GGCCCGATGC CGCAGACTCG TGAGCACATC CTGCTGGGTC 100 

GTCAGGTAGG CGTTCCGTAC ATCATCGTGT TCCTGAACAA ATGCGACATG 150 

GTTGATGACG AAGAGCTGCT GGAACTGGTT GAAATGGAAG TTCGTGAACT 200 

TCTGTCTCAG TACGACTTCC CGGGCGACGA CACTCCGATC GTTCGTGGTT 250 

CTGCTCTGAA AGCGCTGGAA GGCGACGCAG AGTGGGAAGC GAAAATCCTG 300 

GAACTGGCTG GCTTCCTGGA TTCTTACATT CCGGAACCAG AGCGTGCGAT 350 

TGACAAGCCG TTCCTGCTGC CGATCGAAGA CGTATTCTCC ATCTCCGGTC 400 

GTGGTACCGT TGTTACCGGT CGTGTAGAAC GCGGTATCAT CAAAGTTGGT 450 

GAAGAAGTTG AAATCGTTGG TATCAAAGAG ACTCAGAAGT CTACCTGTAC 500 

TGGCGTTGAA ATGTTCCGCA AACTGCTGGA CGAAGGCCGT GCTGGTGAGA 550 

ACGTAGGTGT TCTGCTGCGT GGTATCAAAC GTGAAGAAAT CGAACGTGGT 600 

CAGGTACTGG CTAAGCCGGG CACCATCAAG CCGCACACCA AGTTCGAATC 650 

TGAAGTGTAC ATTCTGTCCA AAGATGAAGG CGGCCGTCAT ACTCCGTTCT 700 

TCAAAGGCTA CCGTCCGCAG TTCTACTTCC GTACTACTGA CGTGACTGGT 750 

ACCATCGAAC TGCCGGAAGG CGTAGAGATG GTAATGCCGG GCGACAACAT 800 

CAAAATGGTT GTTACCCTGA TCCACCCGAT CGCGATGGAC GACGGTCTGC 850 

GTTTCGCAAT CCGTGAAGGC GGCCGTACCG TTGGCGCGGG C 891 



2) INFORMATION FOR SEQ ID NO: 1669 

(i) SEQUENCE CHARACTERISTICS': 

(A) LENGTH: 805 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saksenaea vasiformis 

(B) STRAIN: ATCC 60625 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1669 

TCCTCGTGGT CGCCGCCACC GACGGCCCGA TGCCGCAGAC CAAGGAGCAC 50 

GTGCTCCTGG CCCGCCAGGT CGGCGTTCCG TACATCGTCG TCGCCCTCAA 100 

CAAGGCCGAC ATGGTGGACG ACGAGGAGAT CCTGGAGCTC GTCGAGCTCG 150 

AGGTCCGTGA GCTCCTCTCC GAGTACGAGT TCCCGGGCGA CGACGTTCCC 200 

GTCGTCAAGG TCTCCGCTCT GAAGGCCCTC GAGGGCGACA AGGAGTGGGG 250 

CAACTCGGTT CTCGAGCTCA TGAACGCCGT CGACACGGCG ATCCCCGAGC 300 

CCGAGCGTGA CGTCGACAAG CCGTTCCTCA TGCCGATCGA GGACGTCTTC 350 

ACGATCACCG GTCGCGGTAC GGTCGTCACC GGCCGTATCG AGCGTGGTGT 400 

CCTGAAGGTC AACGAGACCG TCGACATCAT CGGCATCAAG ACCGAGAAGA 450 

CCACCACCAC GGTCACCGGC ATCGAGATGT TCCGGAAGCT CCTCGACGAG 500 

GGCCAGGCCG GTGAGAACGT CGGTCTGCTC CTCCGTGGCA TCAAGCGCGA 550 
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GGACGTCGAG CGCGGCCAGG TCATCATCAA GCCGGGCTCG GTCACGCCGC 
ACACGGAGTT CGAGGCGCAG GCCTACATCC TGTCCAAGGA CGAGGGTGGC 
CGCCACACGC CGTTCTTCAA CAACTACCGC CCGCAGTTCT ACTTCCGTAC 
GACGGACGTG ACCGGCGTGG TGACCCTCCC CGAGGGCACC GAGATGGTCA 
TGCCGGGTGA CAACACCGAG ATGAAGGTGG AGCTCATCCA GCCCGTCGCC 
ATGGA 
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2) INFORMATION FOR SEQ ID NO: 1670 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 935 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trichophyton tonsurans 

(B) STRAIN: ATCC 56185 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1670 

GCTTCTGACG GTCAAATGTA ATTGAATGCC CGCCCAGACA GATGAAAGGA 50 

TTTGACGTTT CTAACATCAG TCTAGGCCTC AGACCAGAGA ACATTTGCTC 100 

CTTGCCCGCC AGGTCGGTGT CCAGAAGCTG GTCGTTTTCG TTAACAAGGT 150 

CGATGCCGTT GAGGACCCAG AGATGTTGGA GCTTGTCGAA CTTGAAATGC 200 

GTGAACTCCT CAGCCACTAC GGTTTCGAGG GTGAGGAGAC CCCCATCATT 250 

TTTGGCTCTG CTCTCTGTGC CCTCGAGTCC CGTCGACCTG AGCTTGGTGT 300 

CGAGAAGATT GACGAGCTAT TGAACGCCGT CGACACCTGG ATCCCCACCC 350 r 

CAGAGCGCGC CACTGATAAG CCTTTCCTCA TGTCCATTGA GGAAGTGTTC 400 

TCTATCTCTG GTCGTGGTAC CGTCGTCTCC GGTCGTGTTG AGCGTGGTAT 450 

CCTCAAGAAG GATTCGGACG TCGAAATTGT TGGTGGCTCT ACCACCCCTA 500 

TCAAGACCAA GGTCACCGAT ATCGAAACCT TCAAGAAGTC CTGCGATGAA 550 

TCTCGAGCTG GTGACAACTC TGGTCTCCTT CTCCGAGGTA TCAAGCGTGA 600 

GGACTTGAAG CGTGGAATGG TTGTTGCTGC CCCCGGATCC ACCAAGGCTC 650 

ACACCGACTT CATGGTCTCC CTCTACGTCC TGACTGAGGC TGAGGGTGGT 700 

CGTTCCAACG GCTTCACCCA CAAGTACCGC CCCCAAATGT TCATCCGTAC 750 

TGCTGGTATG TAACCCAAGT TTCCGCTATT TACTAAGTAG ATCATTGCTA 800 

ACTTGTATTT CCTTCCGTAG ACGAAGCCGC ATCTTTCAGC TGGCCTGGAG 850 

AAGACCAAGA CAAGAAGGCT ATGCCTGGTG ACAACGTCGA GATGATTTGC 900 

AAGACCCTCC ACCCCATTGC TGCCGAGGCT GGCCA 935 



2) INFORMATION FOR SEQ ID NO: 1671 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 772 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: En terobacter aerogenes 

(B) STRAIN: ATCC 13048 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1671 



ACGATGCCCT TGAGGTACAG AATGGTAATG AGAGCCTGGT GCTGGAAGTT 50 

CAGCAGCAGC TCGGCGGTGG CGTAGTCCGT GCTATCGCCA TGGGTTCTTC 100 

CGACGGTCTG CGTCGTGGTC TGGAAGTTAA AGACCTTGAG CACCCGATCG 150 

AAGTCCCGGT AGGTAAAGCG ACTCTGGGCC GTATCATGAA CGTCCTGGGT 200 

15 CAGCCGATCG ACATGAAAGG CGACATCGGC GAAGAAGAAC GTTGGGCTAT 250 

CCACCGCGCG GCGCCTTCCT ATGAAGAGCT GTCCAGCTCT CAGGAACTGC 300 

TGGAAACCGG CATCAAAGTT ATCGACTTGA TGTGTCCGTT CGCTAAGGGC 350 

GGTAAAGTTG GTCTGTTCGG TGGTGCGGGT GTAGGTAAAA CCGTAAACAT 400 

GATGGAGCTG ATCCGTAACA TCGCGATCGA GCACTCCGGT TACTCCGTGT 4 50 

2 0 TTGCGGGCGT TGGTGAGCGT ACTCGTGAGG GTAACGACTT CTATCACGAA 500 

ATGACCGACT CCAACGTTCT GGATAAAGTA TCCCTGGTTT ACGGCCAGAT 550 

GAACGAGCCG CCGGGAAACC GTCTGCGCGT TGCGCTGACC GGCCTGACCA 600 

TGGCTGAGAA ATTCCGTGAC GAAGGTCGTG ACGTTCTGCT GTTCGTCGAT 650 

AACATCTATC GTTACACCCT GGCCGGTACT GAAGTATCTG CACTGCTGGG 700 

25 CCGTATGCCT TCAGCGGTAG GTTATCAGCC GACTCTGGCG GAAGAGATGG 750 

GCGTTCTGCA GGAACGTATC .AC 772 



30 2) INFORMATION FOR SEQ ID NO: 1672 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1401 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bordetella pertussis 

(B) STRAIN: Tohama 1 

(C) ACCESSION NUMBER: Genome project 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1672 

ATGAGCAACG GAACCATCGT TCAGTGCATC GGCGCCGTGG TGGATATTCA 50 

GTTCCCCCGC GATAACATGC CCAAGATCTA CGAAGCGCTC ACCCTGGTCG 100 

ACGAGGGTTC CTCGTTCGCC GAGAAGGGCT TGACGCTGGA AGTGCAACAA 150 

50 CAGCTGGGCG ACGGCGTGGT GCGTACCATC GCGCTGGGTT CCAGCGACGG 200 

GCTGCGCCGC GGCATGCAAG TGGCCGGCAC CGGCGCACCG ATCTCGGTGC 250 

CCGTGGGCCA CGGCACCCTG GGCCGCATCA TGGACGTGCT GGGCCGTCCC 3 00 

ATCGACGAAG CCGGTCCCAT CGCCTCCGAC GAGAAGCGCG CCATCCACCA 350 
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GCCCGCGCCC CGTTTCGACG AGCTGTCGCC GTCGGTCGAG CTGCTGGAAA 4 00 

CCGGCATCAA GGTTATCGAC CTGGTGTGCC CGTTCGCCAA GGGCGGCAAG 450 

GTCGGCCTGT TCGGCGGCGC CGGCGTGGGC AAGACCGTCA ACATGATGGA 500 

ACTGATCAAC AACATCGCCA AGCAGCACAG CGGCTTGTCG GTGTTCGCCG 550 

5 GCGTGGGCGA GCGTACCCGC GAAGGCAACG ACTTCTACCA CGAAATGGAA 600 

GAGTCGAACG TTCTGGACAA GGTGGCCATG GTGTTCGGCC AGATGAACGA 650 

GCCCCCGGGC AACCGCCTGC . GCGTGGCGCT GACCGGCCTG ACCATGGCCG 700 

AGAAGTTCCG CGACGAAGGC CGTGACATCC TGTTCTTCGT CGACAACATC 750 

TACCGCTACA CCCTGGCCGG TACCGAAGTG TCGGCGCTGC TGGGCCGTAT 800 

10 GCCGTCGGCG GTGGGCTACC AGCCTACGCT GGCCGAGGAA ATGGGCGTGC 850 

TGCAAGAGCG CATCACCTCG ACCAAGACCG GTTCGATCAC CTCGATCCAG 900 

GCCGTGTACG TGCCTGCCGA CGACTTGACC GACCCGTCGC CCGCCACGAC 950 

CTTCCAGCAC TTGGACTCGA CCGTCGTGCT GTCGCGTGAC ATCGCTGCGC 1000 

TGGGCATCTA TCCCGCCGTG GACCCGCTGG ATTCCTCCAG CCGCCAGCTC 1050 

15 GACCCGCAAG TCGTGGGCGA AGAGCACTAC CAGGTGGCCC GTGGCGTGCA 1100 

GCAGACGCTG CAGCGCTACA AGGAACTGCG CGACATCATC GCGATTCTGG 1150 

GCATGGACGA ACTGTCGCCG GAAGACAAGC AGGCCGTGGC CCGCGCGCGC 12 00 

AAGATCCAGC GCTTCCTGTC GCAGCCCTTC TACGTGGCCG AAGTGTTCAC 1250 

CGGCTCGCCG GGCAAGTACG TGTCGCTGGC CGAAACGATC CGTGGCTTCA 13 00 

20 AGATGATCGT CGACGGCGAG TGCGACGCGC TGCCCGAGCA GGCGTTCTAC 1350 

ATGGTCGGCA CGATCGACGA GGCCTTCGAG AAGGCCAAGA AACTCCAATA 1400 

A 1401 



25 

2) INFORMATION FOR SEQ ID NO: 1673 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 97 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

MOLECULE TYPE:* Genomic DNA 
ORIGINAL SOURCE: 

(A) ORGANISM: Arcanobacterium haemolyticum 

(B) STRAIN: ATCC 9345 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1673 

CAGCTACCGA CGGTCCAATG GCTCAGACCC GCGAGCACGT TCTTCTTGCT 50 

CGCCAGGTTG GCGTTCCACA GATCATCGTT GCTCTCAACA AGGCTGACAT 100 

GGTTGACGAC GAGGAAATCC TCGAACTCGT CGAAATGGAA GTTCGTGAGC 150 

45 TTCTCTCTTC CCAGGAGTAC CCAGGTGACG ACCTCCCAGT CGTCAAGATC 2 00 

TCGGCACTCA AGGCTCTCGA AGGCGATGCC GAATGGAGCA AGGCAATCGA 250 

AGATCTCATG GAAGCTGTCG ATACCTACTT CGACGATCCA GTGCGTGACC 3 00 

TCGATAAGCC ATTCCTCATG CCAATCGAAG ACGTCTTCAC CATCACCGGT 3 50 

CGTGGCACCG TTGTTACCGG CCGTGCAGAG CGCGGTATGC TCAACTTGAA 400 

50 CGAAGAAGTT GAAATCCTCG GTATCCGTGC ACCACAGAAG ACAACCGTTA 450 

CCGGTATCGA AATGTTCCAC AAGTCGATGG ATCACGCAGA TGCAGGCGAA 500 

AACTGTGGTC TTCTCCTCCG TGGCACCAAG CGCGAAGATG TTGAACGTGG 550 

TCAGGTTGTT GCCAAGCCAG GCACCATCAC CCCACACACC AACTTCGAAG 600 

866 
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CTCAGGTCTA CGTGCTCGGT AAGGAAGAAG GTGGCCGTCA CAACCCATTC 
TTCTCCAACT ACCGTCCACA GTTCTACTTC CGTACCACGG ATGTTACCGG 
CGTGATCACC CTTCCAGAGG GCACCGAAAT GGTTATGCCA GGCGACAACA 
CCGACATGAC AGTTGAGCTC ATCCAGCCAA TCGCTATGGA AGAGGGC 
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2) INFORMATION FOR SEQ ID NO: 1674 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 785 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Butyrivibrio fibrisolvens 

(B) STRAIN: ATCC 19171 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1674 

CTGATGGTCC TATGCCACAG ACCCGTGAGC ACATCCTATT AGCACGTCAG 50 

GTAGGCGTAC CATACATCAT CGTATTCCTA AACAAGTGCG ATATGGTTGA 100 

CGACGAGGAA TTATTAGAGT TAGTTGAGAT GGACGTACGT GATCTATTAA 150 

ATCAGTACCA GTTCCCAGGC GACGACACTC CAATCATCCG- TGGTTCAGCA 200 

CTAGGTGCAT TAAACGGCGA AGAGAAGTGG AAAGAGGCAA TCTATCAGTT 2 50 

AGCAGACACT CTAGATTCAT ACATTCCAGA GCCAAAGCGT GATATCGATG 300 

ATCCATTCCT ATTACCAATC GAAGATATCT TCTCAATCTC AGGTCGTGGT 350 

ACTGTAGTAA CCGGCCGTGT AGAGCGTGGT ATTGTACACG TAGGTGACGA 4 00 

AGTTGAAATC GTTGGTATTC GTCCAACCAC CAAGACCACT GTAACTGGCG 450 

TTGAAATGTT CCGTAAGTTA CTAGACGAAG GTCGTGCAGG TGATAACGTT 500 

GGTGTTCTAC TACGTGGTAC CAAGCGTGAT GAGGTTGAGC GTGGTCAGGT 550 

TCTAGCTGCT CCAGGCACAA TCACTCCACA CACCAAGTTC ACTGGTCAGG 600 

TTTACGTACT AAGCAAGGAT GAAGGTGGTC GTCACACTCC ATTCTTCAAG 650 

GGCTACCGTC CACAGTTCTT CTTCCGTACA ACCGATATTA CCGGTTCTAT 700 

CGATCTGAAA GAGGGCGTAG AGATGGTAAT GCCAGGTGAT AACACCGACA 750 

TGACCGTAAC CCTAATCCAC CCAGTAGCTA TGGCT 785 



2) INFORMATION FOR SEQ ID NO: 1675 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 835 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Campylobacter jejuni subsp. doylei 

(B) STRAIN: ATCC 49349 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1675 

5 

TGGGGCGATC TTGGTTGTTT CTGCTGCAGA TGGTCCTATG CCACAAACTA 50 

GAGAGCACAT TCTTCTTTCT CGTCAAGTAG GCGTTCCATA TATTGTTGTT 100 

TTTATGAATA AAGCAGATAT GGTTGATGAT GCTGAGCTTT TAGAGTTAGT 150 

TGAAATGGAA ATTAGAGAAT TATTAAGCTC TTATGATTTC CCAGGCGATG 200 

10 ATACACCTAT TATTTCTGGT TCTGCTTTAA AAGCTCTTGA AGAAGCTAAA 250 

GCTGGACAAG ATGGTGAATG GTCAGCAAAA ATTATGGATC TTATGGCTGC 300 

AGTTGATAGT TATATTCCAA CTCCAACTCG TGATACTGAA AAAGACTTCT 350 

TGATGCCAAT TGAAGACGTT TTCTCAATTT CAGGTCGTGG TACTGTTGTT 4 00 

ACAGGTAGAA TTGAAAAAGG TGTTGTAAAA GTAGGTGATA CTATCGAAAT 450 

15 CGTTGGTATT AAAGACACTC AAACAACAAC TGTAACAGGT GTTGAAATGT 500 

TCAGAAAAGA AATGGATCAA GGCGAGGCAG GAGATAACGT AGGTGTTCTT 550 

CTTCGTGGTA CTAAAAAAGA AGAAGTTATT CGCGGTATGG TTCTTGCTAA 600 

ACCAAAATCA ATTACTCCAC ACACTGACTT CGAAGCTGAA GTTTATATCT 650 

TAAATAAAGA TGAAGGTGGT AGACATACTC CATTCTTTAA CAACTATAGA 700 

20 CCACAGTTTT ATGTAAGAAC AACTGATGTT ACAGGTTCGA TTAAATTAGC 750 

TGATGGTGTT GAAATGGTTA TGCCAGGTGA AAATGTGAGA ATTACTGTAA 800 

GCTTGATCGC TCCAGTAGCA CTTGAAGAAG GAACT 835 



25 



35 



2) INFORMATION FOR SEQ ID NO: 1676 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 812 bases 
30 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter lari 

(B) STRAIN: ATCC 43675 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1676 

TTCTGCAGCA GACGGCCCTA TGCCACAAAC TAGAGAGCAT ATCTTACTTT 50 

CTCGTCAAGT AGGTGTACCA TATATTGTTG TTTTCATGAA CAAAGCTGAT 100 

ATGGTTGATG ATGCAGAATT ATTAGAATTA GTTGAAATGG AAATTAGAGA 150 

45 ATTACTAAGC TCTTATGATT TCCCAGGAGA TGATACTCCA ATTATTTCAG 200 

GTTCAGCATT ACAAGCTCTT GAAGAAGCAA AAGCTGGTCA AGATGGTGAA 250 

TGGTCTAAAA AAATCTTAGA TCTTATGGCT GCAGTTGATG ATTATATTCC ,300 

AACTCCGGCT CGTGATACAG ATAAAGATTT CTTGATGCCA ATCGAAGATG 350 

TTTTCTCAAT CTCAGGTCGT GGAACTGTTG TTACCGGTAG AATTGAAAAA 400 

50 GGTGTTGTAA AAGTTGGTGA TACTATAGAA ATCGTTGGTA TTAGAGACAC 450 

TCAAACAACC ACAGTTACTG GTGTTGAAAT GTTTAGAAAA GAAATGGATC 500 

AAGGTGAAGC TGGTGATAAT GTTGGTGTAT TACTTCGTGG AACTAAAAAA 550 

GAAGATGTTG AACGTGGTAT GGTTCTTGCT AAACCAAAAT CAATCACTCC 600 
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ACATACTGAT TTTGAAGCAG AAGTTTATAT CTTAAATAAA GATGAAGGTG 
GTCGTCATAC TCCATTCTTT AATAATTATA GACCGCAATT TTATGTAAGA 
ACAACTGATG TTACAGGTGC TATTAAACTT GCAGAAGGCG TTGAGATGGT 
TATGCCAGGC GATAATGATA GAATTAC TGT AAGTCTTATT GCTCCAGTTG 
CACTTGAGGA AG 
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2) INFORMATION FOR SEQ ID NO: 1677 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter sputorum subsp. sputorum 

(B) STRAIN: ATCC 35980 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1677 

GCTATTCTTG TATGTTCAGC TGCAGATGGT CCAATGCCAC AGACTAGAGA 50 

GCATATTCTA CTATCAAGAC AAGTTGGTGT TCCATACATA GTTGTTTTCT 100 

TAAATAAAGA AGATATGGTT GATGATGCTG AGCTTATAGA GTTGGTTGAA 150 

GTTGAGGTTA GAGATTTATT AAATGAATAT GATTTCCCTG GAGATGATAC 200 

TCCAATCGTA ATAGGTT CTG CTCTTAAAGC TTTAGAAGAA GCAAAAGCTG 250 

GAACAGAGGG TGAATGGTCT GCTAAAATTA TGAAACTTAT GGATGCTGTT 300 

GATAGCTATA TCCCAACTCC AACAAGAGAT ACAGATAAAG ATTTCCTTAT 350 

GCCAATCGAA GATATCTTCT CAATTTCTGG TCGTGGTACA GTTGTAACAG 400 

GTAGAATTGA AAAAGGTGTA GTAAAAGTTG GCGAGACTAT TGAGATAGTT 450 

GGTATTAGAC CTACTCAAAC AACAACAGTT ACTGGTGTTG AAATGTTTAG 500 

AAAAGAGCTA GATCAAGGTG AAGCTGGAGA TAATGTTGGT ATCTTGTTAA 550 

GAGGTACAAA AAAAGAAGAT GTTGAAAGAG GTATGGTTTT ATGTAAACCA 600 

AAATCAATCA CTCCTCACAA GAAATTTGAA GGCGAAGTTT ATATTCTTAC 650 

AAAAGATGAA GGTGGTAGAC ATACTCCTTT CTTTAGTAAC TATAGACCAC 700 

AATTTTATGT TAGAACAACA GATGTAACAG GTTCTATATC TCTTCCTGAG 750 

GGAACAGAGA TGGATATGCC TGGTGATAAT GTAAAACTTA CAGTTGAACT 800 

TATAAACCCA ATTGCTCTTG AGCAAGGA 828 



2) INFORMATION FOR SEQ ID NO: 1678 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 820 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter upsaliensis 

(B) STRAIN: ATCC 49815 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1678 



CGATTTTGGT TGTTTCTGCT GCTGATGGTC CTATGCCACA AACTAGAGAG 50 

CACATTTTGC TTTCTCGTCA AGTGGGTGTT CCTTATATCG TAGTTTTTAT 100 

10 GAATAAGGCT GATATGGTTG ATGATGCAGA GCTTTTAGAA TTGGTTGAAA 150 

TGGAAATTAG AGAACTTTTA AGCTCTTATG ATTTCCCGGG CGATGACACT 200 

CCTATCATTT CAGGCTCTGC TCTTCAAGCC TTAGAAGAGG CTAAGGCGGG 250 

ACAAGATGGC GAGTGGTCAG CTAAGATTTT AGAGCTTATG AAGGCAGTTG 300 

ATGAGTATAT CCCAACTCCT GTTCGCGATA CTGAAAAAGA TTTCTTGATG 350 

15 CCTATTGAAG ATGTTTTTTC AATTTCTGGT CGTGGAACTG TTGTAACAGG 400 

TAGAATTGAA AAAGGTGTGG TTAAAGTCGG CGATACTATT GAGATAGTAG 450 

GTATCAAAGA TACTCAAACT ACAACAGTTA CAGGCGTTGA GATGTTTAGA 500 

AAAGAAATGG ATCAAGGTGA GGCTGGCGAT AATGTCGGTG TGCTTTTAAG 550 

AGGAACAAAA AAAGAAGATG TTCTTCGTGG TATGGTTCTT GCAAAGCCTA 600 

20 AATCTATCAC TCCTCATACT GATTTTGAAG CAGAAGTTTA TATTCTAAAT 650 

AAAGATGAGG GCGGTCGCCA TACTCCTTTC TTTAACAATT ATCGTCCGCA 700 

GTTTTATGTA AGAACGACTG ATGTAACTGG TTCTATTAAA TTAGCTGATG 750 

GTGTTGAGAT GGTTATGCCG GGTGAAAATG TAAGAATTAC AGTTAGCCTT 800 

ATCGCTCCAG TTGCACTTGA 820 

25 



2) INFORMATION FOR SEQ ID NO: 1679 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 783 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Globicatella sanguis 
40 (B) STRAIN: ATCC 51173 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1679 

CTGCAGCTGA TGGTCCAATG CCTCAAACAC GTGAACATAT CTTATTATCA 50 

45 CGTCAAGTAG GTGTTCCTTA CATGGTTGTC TTCTTAAACA AAGTTGACAT 100 

GGTTGACGAT GAAGAATTAT TAGAATTAGT TGAAATGGAA GTTCGTGATT 150 

TATTATCTGA ATACGAATTC CCTGGAGACG ACGTTCCAGT AATCGCTGGT 200 

TCAGCTTTAA AAGCTTTAGA AGGCGAAGAA CAATATGAAG CAAAAGTATT 250 

AGAATTAATG GAAGCTGTAG ATACATACAT TCCAGAACCA GTTCGTGATA 300 

50 CTGAAAAACC ATTCATGATG CCAGTTGAAG ATGTGTTCTC AATCACAGGT 350 

CGTGGTACAG TTGCTACTGG ACGTGTTGAA CGTGGACAAG TTAAAGTTGG 400 

TGACGAAGTT GAAATCGTTG GTATTGCTGA AGAAATTAGC AAAACAACTG 450 

TAACTGGTGT TGAAATGTTC CGTAAATTAT TAGATTACGC TGAAGCTGGA 500 
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GATAACATTG GTGCGTTATT ACGTGGTGTT ACACGTGAAC AAATCCAACG 
TGGTCAAGTA TTAGCAAAAC CAGGAACAAT TACACCTCAT ACTAAATTCG 
AGGCGGAAGT TTACGTATTA TCAAAAGAAG AAGGTGGACG TCATACTCCA 
TTCTTCGCTA ACTACCGTCC TCAATTCTAC TTCCGTACAA CTGACGTTAC 
AGGTGTTGTA GAATTACCAG AAGGTACAGA AATGGTAATG CCTGGAGATA 
ACGTATCAAT GACAGTTGAA TTAATTCACC CAA 
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2) INFORMATION FOR SEQ ID NO: 1680 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactobacillus acidophilus 

(B) STRAIN: ATCC 4356 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1680 

GCTATCTTAG TTGTTGCTGC AACTGATGGT CCTATGCCAC AAACTCGTGA 50 

ACACATTTTG CTTGCTCGTC AAGTTGGTGT TAACTACATC GTAGTATTCT 100 

TGAACAAGTG CGATTTAGTT GACGACCCAG AATTGATCGA CTTGGTTGAA 150 

ATGGAAGTTC GTGACTTGTT GACTGAATAC GATTACCCTG GTGATGATAT 200 

TCCAGTTGTT CGTGGTTCAG CATTAAAGGC TTTACAAGGT GACAAGGAAG 250 

CTCAAGACCA AATCATGAAG TTGATGGACA TTGTTGATGA ATACATCCCA 3 00 

ACTCCAGAAC GTCAAACTGA CAAGCCATTC TTGATGCCAG TTGAAGACGT 350 

ATTCACTATC ACTGGTCGTG GTACTGTTGC TTCAGGTCGT ATCGACCGTG 400 

GTACTGTTAA GGTCGGTGAC GAAGTTGAAA TCGTTGGTTT GGTAGATAAA 450 

GTTCTTAAGT CAGTTGTTAC TGGTTTGGAA ATGTTCCACA AGACTTTGGA 500 

CTTAGGTGAA GCCGGCGATA ACGTTGGTGT ATTGCTTCGT GGTGTTGACC 550 

GTGATCAAGT TGTTCGTGGT CAAGTATTGG CTGCACCCGG CTCAATCCAA 600 

ACTCATAAGA AGTTTAAGGC ACAAGTTTAT GTTTTGAAGA AGGACGAAGG 650 

TGGTCGTCAC ACTCCATTCT TCTCAGACTA CCGTCCACAA TTCTACTTCC 700 

ACACCACTGA TATTACTGGT GAAATTGAAT TGCCAGAAGG TACTGAAATG 750 

GTTATGCCTG GTGATAACAC TGAATTCACT GTTACTTTGA TCAAGCCAGC 800 

TGCCATCGAA AAGGGTACTA AGT 823 



2) INFORMATION FOR SEQ ID NO: 1681 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 793 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leuconostoc mesenteroides subsp. 
dextranicum 

(B) STRAIN: ATCC 19255 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1681 

GCTGCAACTG ATGGTCCTAT GCCACAAACA CGTGAACACA TCTTGTTGGC 50 

ACGTCAAGTT GGTGTTGACT ACTTGGTTGT CTTCTTGAAC AAGACTGATT 100 

TGGTTGATGA TGAAGAATTA GTTGAATTGG TTGAAATGGA AGTTCGTGAA 150 

TTGTTGTCAG AATATGACTT CCCAGGTGAC GATATTCCTG TACTTAAGGG 200 

TTCAGCTTTG AAGGCTTTGG AAGGTGATCC TGAACAAGTT AAGGTTATCG 250 

AAGAATTGAT GGATACTGTT GATTCATACA TTCCAGAACC AGCACGTGAA 3 00 

ACAGACAAGC CATTCTTGAT GCCTGTCGAA GACGTCTTCA CAATCACTGG 350 

TCGTGGTACA GTTGCTTCTG GTCGTGTTGA CCGTGGTGTA TTGACTACAG 400 

GAACTGAAAT TGAAATCGTT GGTTTGAAGG ACGAAGTTCA AAAGACTACT 450 

GTTACAGGTA TCGAAATGTT CCGTAAGACT TTGGAAGAAG CTCAAGCGGG 500 

TGATAACATT GGTGCATTGT TGCGTGGTGT TGATCGTAGC AACATTGAAC 550 

GTGGTCAAGT TTTGGCAAAG CCAGGTTCAA TTAAGACACA CAAGAAGTTC 600 

AAGGCTGAAG TTTATGTCTT GACAAAGGAA GAAGGTGGTC GTCATACACC 650 

ATTCTTCACT AACTACCGTC CACAATTCTA CTTCCACACA ACTGATGTTA 700 

CAGGTGTTGT TGAATTGCCA GCCGGTGTTG AAATGGTTAT GCCTGGTGAC 750 

CAAGTGACAT TCGAAATCGA ATTGATCTCA CCAGTTGCCA TCG 793 



2) INFORMATION FOR SEQ ID NO: 1682 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Prevotella buccalis 

(B) STRAIN: ATCC 35310 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1682 

TATCCTTGTA GTTGCTGCTA CTGATGGTCC TATGCCACAG ACACGTGAGC 50 

ACGTGCTTTT GGCTCGTCAG GTAAACGTTC CTCGTTTGGT TGTGTTCATG 100 

AACAAGTGTG ACTTGGTAGA AGACGAAGAG ATGCTTGAAC TCGTTGAAAT 150 

GGAGTTGCGC GAACTTCTTG AGCAATACGA ATTCGAAGAG GATACTCCAA 2 00 

TCGTTCGTGG TTCTGCACTG GGTGCATTGA ATGGTGTTGA CAAGTGGGTT 250 

GACAGCGTGA TGACGTTGAT GGACACTGTT GACGAGTGGA TTCAAGAGCC 3 00 

AGAGCGTGAC CTTGACAAAC CTTTCTTGAT GCCAGTAGAG GACGTGTTCT 3 50 

CTATCACAGG TCGTGGTACC GTTGTAACAG GACGTATTGA GACTGGTAAG 400 

GTAAAGGTTG GCGACGAGAT TCAGTTGCTC GGTCTTGGTG AGGACAAGAA 450 
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GTCTGTTGTA ACAGGCGTTG AAATGTTCCG 
AAGCAGGTGA TAACGTAGGA CTTTTGCTCC 
GTAAAGCGTG GTATGGTTGT TGTACACCCA 
TCACTTCAAG GCTTCCATCT ATGTATTGAA 
5 ATACTCCATT CGGAAACAAG TATCGTCCTC 
GACTGTACAG GTGAGATCAC TTTGCCAGAA 
TGGTGACAAC GTAGAGATTG AGGTTACCTT 
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TAAGATTCTT TCTGAAGGTG 500 

GCGGTATCGA TAAGGATGAA 550 

GGTGCCATCA CTCCTCACGA 600 

GAAGGAAGAG GGTGGACGTC 650 

AGTTCTATCT CCGTACAATG 700 

GGCGTAGAGA TGGTGATGCC 750 

GATTTACAAG GTTGCC 796 



10 



20 



2) INFORMATION FOR SEQ ID NO: 1683 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 800 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ruminococcus bromii 

(B) STRAIN: ATCC 27255 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1683 

GGTTGCTGCT ACTGACGGCC CGATGCCTCA GACTCGTGAG CACGTTCTGC 50 

TCGCTCGTCA GGTGGGTGTG CCCGCCATCG TCGTCGCCCT CAACAAGTGC 100 

GACATGGTTG ACGATGAGGA GCTCATTGAG CTTGTCGAGA TGGAGGTCCG 150 

30 CGAGCTGCTG ACCTCGCAGG AGTTCGACGG CGACAACTGC CCTGTCGTTC 200 

GCATCTCCGC CTTCCAGGCC CTCCAGGGTG ACGAGAAGTG GACCCAGTCG 250 

ATCCTCGACC TCATGGACGC CGTGGACGAG TACATCCCGC AGCCTGAGCG 300 

CGATCTCGAC AAGCCCTTCC TTATGCCGAT CGAGGACGTC TTCACCATCA 350 

CCGGCCGTGG CACCGTTGTC ACCGGTCGTG TCGAGCGTGG TGTCGTCAAG 400 

35 ACTGGCGAAG AGGTCGAGAT CGTCGGTATC CACGAGAAGA CCCAGAAGAC 450 

CACCGTTACC GGTGTCGAGA TGTTCCGTAA GATCCTCGAC GAGGGCCGCG 500 

CTGGTGAGAA CGTCGGCGTT CTGCTCCGTG GCACCAAGAA GGAGGACGTC 550 

GTTCGCGGCA TGGTCCTCTC CAAGCCTGGT TCCACCACCC CCCACACCGA 600 

CTTCGAGGGC CAGGTCTACG TCCTCAAGAA GGATGAGGGT GGCCGCCACA 650 

40 AGCCGTTCTT CTCCCATTAC AGCCCCCAGT TCTACTTCCG TACCACGGAC 700 

GTGACTGGCA CTGTTGAGCT CCCCGAGGGC ACCGAGATGG TCATGCCTGG 750 

CGACAACACC GACATGACTG TGCACCTGAT TCACCCGGTT GCCATGGAGG 800 



45 

2) INFORMATION FOR SEQ ID NO: 1684 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 545 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Paracoccidioides brasiliensis 
5 (B) STRAIN: ATCC 32075 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1684 

TGTCTTTATT CAGGAACTGA TTGTATGTTT CTTCTCGTTT ATATATAACA 50 

10 TACCTTCTAT ATTTCATGTG TTTCTAACGA ACTCATAGAA CAACATTGCC 100 

AAGGCTCACG GTGGTTACTC CGTGTTCACC GGTGTCGGTG AGCGTACCCG 150 

TGAAGGAAAC GATCTGTACC ACGAAATGCA GGAAACCCGC GTCATCCAGC 200 

TGGACGGCGA GTCCAAAGTC GCCCTCGTCT TCGGCCAGAT GAACGAGCCC 250 

CCCGGAGCCC GTGCCCGTGT TGCCCTGACC GGTCTGACCA TCGCTGAATA 300 

15 CTTCCGTGAC GAAGAAGGCC AAGATGGTAC GTTCCCCCAT TCCATATATG 350 

TTTCTTGTGC GCTTTGCCAA CTAAACACCA CCTAGTGCTC CTCTTCATCG 400 

ACAATATCTT CCGCTTCACC CAAGCCGGTT CCGAAGTGTC CGCCCTGCTA 4 50 

GGCCGCATCC CCTCCGCCGT CGGCTATCAA CCCACCCTCG CCGTCGACAT 500 

GGGTGGTATG CAGGAGCGCA TCACAACTAC AACAAAAGGC TCCAT 545 

20 



2) INFORMATION FOR SEQ ID NO: 1685 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1020 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) .TOPOLOGY: Linear. 



30 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida norvegica 
35 (B) STRAIN: ATCC 36586 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1685 

GATATCGCTT TATGGAAATT CGAAACTCCA AAATTCCACG TTACYGTTAT 50 

40 CGATGCTCCA GGTCACAGAG ATTTCATCAA GAATATGATT ACYGGTACCT 100 

CCCAAGCTGA TTGTGCTATT TTAATCATTG CTGGTGGTGT TGGTGAATTC 150 

GAAGCTGGTA TCTCCAAAGA TGGTCAAACC AGAGAACACG CTTTGTTAGC 200 

TTTCACCTTA GGTGTYAAAC AATTGATTGT TGCCGTTAAC AAAATGGACT 250 

CTGTCAAATG GGATCAATCC CGTTTCGAAG AAATCGTCAA GGAAGCTTCC 300 

45 GGTTTCATCA AGAAAGTTGG TTACAACCCA AAGACTGTTC CATTCGTTCC 350 

AATCTCTGGT TGGAATGGTG ACAACATGAT TGAAGTYTCW GCTAACGCYC 4 00 

CATGGTACAA AGGTTGGGAA AAGGAAACCA AAGCYGGTGT CGTTAAAGGT 450 

AAAACTTTAT TAGAAGCCAT TGATGCTATT GAACCACCTT CAAGACCAAC 500 

TGAAAAACCA TTGAGATTGC CATTGCAAGA TGTCTACAAG ATTGGTGGTA 550 

50 TCGGAACCGT ACCAGTCGGT ARAGTTGAAA CCGGTGTCAT TAAACCAGGT 600 

ATGATTGTTA CTTTCGCCCC AGCCGGTGTT ACTACTGAAG TCAAATCTGT 650 

TGAAATGCAT CACGAACAAT TAGAAGCTGG TTACCCAGGT GACAATGTTG 700 

GTTTCAACGT CAAGAATGTT TCAGTTAAAG AAATCAGAAG AGGTAAHGTT 750 
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GCTGGTGACT CCAAGAACGA TCCACCAAAA 

TCAAGTTATT GTCTTGAACC ATCCAGGTCA 

CAGTTTTGGA TTGTCACACT GCCCACATTG 

TTGGAAAAGA TTGACAGAAG ATCCGGTAAG 

5 ATTCATCAAA TCTGGTGACG CTGCTAWTGT 

CATTRTGTGT TGAAGCTTTC 
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GGTGCTGAAT CTTTCAACGC 800 
AATCTYTGCT GGTTACTYTC 850 
CTTGTAAATT CGATGAAATC 900 
AAATTGGAAG AAAATCCAAA 950 
CAAATTTGTT CCATWTAAAC 1000 

1020 



10 2) INFORMATION FOR SEQ ID NO: 1686 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 929 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus nidulans 

(B) STRAIN: WSA-176 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1686 



TGCTGCTTCC GATGGTCAAA TGTACGATTG ATATTCCTTC CAGCCAGTCA 50 

GGATAACAGC TGATACCAGT TGCAAATAGG CCCCAGACTC GTGAGCACTT 100 

GTTGCTTGCC CGTCAGGTTG GTGTCCAGAA GATCGTTGTC TTCGTCAACA 150 

• AGGTTGACGC TGTCGATGAC CCTGAGATGT TGGAGCTTGT TGAGCTCGAG 200 

30 ATGCGTGAGC TCCTCAACAC TTACGGTTTC GAGGGAGAGG AGACCCCTAT 250 

CATCTTCGGT TCCGCCCTGT GCGCTCTCGA AGGCCGCCGC GAGGACATTG 300 

GTACTCAGCG TATTGACTCC CTCCTCGAGG, CCGTTGACAC TTGGATCCCT 350 

ACCCCCCAGC GTGACTTGGA CAAGCCCTTC CTGATGTCCA TTGAGGAAGT 400 

TTTCTCCATT GGTGGTCGTG GTACCGTCGC CTCTGGTCGT GTCGAGCGTG 450 

35 GTCTCCTCAA GAAGGATACC GAAGTTGAAA TTCACGGTGC TGATGGTATT 500 

CTGAAGACCA AGGTCACCGA CATTGAGACC TTCAAGAAGA GCTGCGATGA 550 

GTCTCGTGCT GGTGACAACT CCGGTCTTCT CCTCCGTGGT ATCCGTCGTG 600 

AGGATGTTCG TCGTGGTATG GTCATCGCTG CCCCTGGCTC CATCAAGGCC 650 

TCCAAGAAGT TCATGGTCTC CATGTACGTC TTGACTGAGG CTGAAGGTGG 700 

40 CCGCAAGAAC GGCTTCGGTG CCAACTACCG CCCCCAGGCT TTCATCCGCA 750 

CTGCTGGTAA GTTTCGAACT ATTTGATTCA TTGATCACGT CCCTAACTGT 800 

TACTTTAGAC GAGGCTTGCG ACCTTCATTT CCCTGATGAG GCCGACAAGG 850 

ACCGCCACGT CATGCCCGGT GACAACGTCG AAATGGTCCT CAACCTCAAC 900 

AACCCCGTTG CTGCTGAGGC TGGACAGCG 929 

45 



2) INFORMATION FOR SEQ ID NO: 1687 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 951 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus terreus 

(B) STRAIN: WSA-174 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1687 



TGCCGCTTCC GATGGTCAGA TGTACGCTCA AGCCCCAGTT TCCATATAAA 50 

CATAAACGAT CTATCATCAG CACAACGCTG ACTTCTTCGC TTCCAGGCCC 100 

CAGACCCGTG AGCACTTGCT GTTGGCCCGT CAGGTCGGTG TCCAGAAGAT 150 

CGTGGTCTTC GTCAACAAGG TCGATGCCGT TGATGACCCG GAGATGTTGG 200 

15 AGCTCGTTGA GCTGGAAATG CGCGAGCTCC TGACCAGCTA CGGATTCGAG 250 

GGTGAAGAGA CCCCCATCAT CTTCGGTTCT GCTCTCTGCG CTCTTGAGGG 300 

CCGCCGTCCT GAGATTGGTA CTGAGAAGAT TGACGAGCTG ATGCACGCCG 350 

TCGACACCTG GATCCCCACC CCCCAGCGTG ACCTCGACAA GCCCTTCCTG 400 

ATGTCCGTCG AGGAAGTCTT CTCCATTGCT GGTCGTGGTA CCGTCGCTTC 450 

20 CGGCCGTGTC GAGCGTGGTA TTCTGAAGAA GGATAGCGAA GTCGAGATCA 500 

TCGGTGGTGC TTTCGACGCC ACGAAGACCA AGGTCACTGA CATCGAGACC 550 

TTCAAGAAGT CTTGCGACGA GTCTCGCGCT GGTGACAACT CTGGTCTCCT 600 

CCTCCGTGGT ATCCGTCGTG AGGATGTTCG GCGTGGTATG GTCATTGCTG 650 

CTCCTGGCAG CACCAAGGCC CACGACAAGT TCCTTGTCTC TATGTACGTC 700 

25 CTCACTGAGG CTGAGGGTGG CCGTCGTACC GGATTCGGTA CCAACTACCG 750 

CCCCCAGGTC TTCATCCGTA CTGCCGGTAA GTGTTCCTGG AAGAGGCTTT 800 

GAGCCTATAT AGGATCTCGG ATAATTTACT AATCCACCAT ATAGATGAGG 850 

CCGCTGACCT CAGCTTCCCC GACAACGATG ACTCCCGCCG TGTCATGCCC 900 

GGTGACAACG TTGAGATGGT GCTGAAGACC CACCGCCCCG TGGCTGCTGA 950 

30 G 951 



2) INFORMATION FOR SEQ ID NO: 1688 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
40 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
45 (A) ORGANISM: Candida norvegica 

(B) STRAIN : ATCC 36586 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1688 

50 CGTTGCCGCT ACCGATGGTC AAATGCCTCA AACTAGAGAA CATTTGCTAT 50 

TGGCTAGACA GGTTGGTGTT CAACACATTG TCGTGTTTGT TAACAAGGTT 100 

GACACTATTG ATGATCCAGA AATGTTGGAA TTGGTTGAAA TGGAAATGAG 150 

AGAGTTGATT GCCACTTATG GTTTCGATGG TGATAACACC CCAGTTATCA 200 
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TGGGTTCTGC 
CAATCAATCG 
AACTAGAGAT 
CCATTTCTGG 
TTGAAGAAAG 
CAAAACCACC 
CTATGGCTGG 
GATATCAAGA 
CACCAAATTC 
GTCATTCAGG 
GGTGATGTCA 
GCAAGTCTTA 
CAACTGCTCT 



TCTATGTGCT 
ACAGATTGTT 
TTGGAAAAAC 
TAGAGGTACC 
GTGATGAAAT 
GTCACCGGTA 
TGATAACTGT 
GAGGTATGGT 
TTGGCCTCTT 
TTTTGCTGAA 
CTGTTACTTT 
CCAGGTGACA 
TGAAACCGGT 



TTGGAAGGTC 
GGAAGCCGTT 
CATTCTTGAT 
GTCTGTACCG 
CGAAATTGTC 
TTGAGATGTT 
GGTATCTTAT 
TATCTCTAAA 
TATACGTCTT 
AACTACAGAC 
AACCTTCCCA 
ACGTTGAAAT 
CAA 



GTGAACCTGA 
GATGAATACA 
GGGTGTTGAA 
GTCGTGTTGA 
GGCTACAACA 
CAAAAAGGAA 
TACGTGGTGT 
GTCAACACCG 
GACTAAAGAA 
CTCAATTGTT 
GAAGATGCTG 
GGAATGTACC 



AATCGGTGCT 
TTCCAACTCC 
GATGTCTTCT 
AAGAGGTAAC 
AGACTCCAAT 
TTAGACCAAG 
TAAGAGAGAT 
TTTCCGCACA 
GAAGGTGGTC 
CATCAGAACC 
ATCACTCTCA 
TTGGTTCATC 



250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
823 



10 



15 



20 



25 



30 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1689 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
<D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida parapsilosis 

(B) . STRAIN: ATCC 201076 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1689 

GCTGCTACTG ACGGTCAAAT GCCTCAAACT AGGGAACATA TGTTGTTGGC 50 

GAGACAAGTT GGTATCCAAA ACTTGGTTGT TTTTGTTAAC AAAGTTGATA 100 

CCATTGATGA CCCAGAAATG TTGGAATTGG TTGAAATGGA AATGAGGGAA 150 

TTATTGAGCT CTTATGGGTT TGATGGTGAA AACACTCCAG TTATCATGGG 200 

ATCAGCCTTG TGTGCTTTAG AAGGTAAACA ACCAGAAATC GGTGTTCAAG 250 

CCATTCAAAA ATTATTGGAT GCTGTTGATG AATATATTCC AACTCCAGAA 300 

AGAGATGCTG ACCAACCATT TTTGATGCCA GTGGAAGATG TGTTTTCTAT 350 

TTCAGGTAGA GGAACCGTTG TCACCGGAAG AGTTGAAAGA GGTATGTTGA 400 

AGAAAGGTGA AGAAGTAKAA GTCATTGGTG AAAACTCATT TAAGGCTACT 4 50 

TCCACGGGTA TTGAGATGTT CAAAAAGGAA TTGGATGCCG CTATGGCCGG 500 

TGACAACTGT GGTATTTTGT TGAGAGGTGT CAAGAGAGAC GAAGTCAAGA 550 

GGGGTATGGT TTTGGCCAAA CCAGGTACCA CCACCCCACA CCAAAAGTTT 600 

TTGGCTTCCA TTTATATCTT GACTGCTGAA GAAGGTGGAC GTAGTACCCC 650 

TTTCAGTGAA GGATACAAAC CACAATGTTT CTTTAGAACT AGTGATGTTA 700 

CCACGACATT TACTTTCCCA GAAGGTGAAG GTGTTGACCA CTCACAAATG 750 

GTTATGCCAG GAGRCAATGT TGAAATGGTG GGAACTTTAA TCAAGAAAGC 800 

TCC 803 



2) INFORMATION FOR SEQ ID NO: 1690 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1690 
CAGGTCCTGT TGCGACTGAA GAA 



2) INFORMATION FOR SEQ ID NO: 1691 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii)- MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1691 
CACAGATAAA CCTGAGTGTG CTTTC 



2) INFORMATION FOR SEQ ID NO: 1692 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1692 
GGTGAGAACT GTGGTATCTT ACTT 



2) INFORMATION FOR SEQ ID NO: 1693 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

878 
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(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1693 



PCT/CA00/01150 



CATTTCAACG CCTTCTTTCA ACTG 



24 



10 



15 



20 



2) INFORMATION FOR SEQ ID NO: 1694 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1694 
AAGGCAAGGA TGACAACGGC 



20 



25 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1695 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1695 
ACGATTTCCA CTTCTTCCTG G 



21 



40 



45 



50 



2) INFORMATION FOR SEQ ID NO: 1696 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1696 

879 
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ATGTTCCTGT AGTTGCTGGA 



2) INFORMATION FOR SEQ ID NO: 1697 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1697 
TTTCTTCAGC AATACCAACA AC 



2) INFORMATION FOR SEQ ID NO: 1698 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1698 
GGAATCAACA GATGGTTTAC AAA 



2) INFORMATION FOR SEQ ID NO: 1699 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1699 
GCATCTTCTG GGAAAGGTGT 
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2) INFORMATION FOR SEQ ID NO: 1700 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1700 
AAGATGCGGA AAGAAGCGAA 



2) INFORMATION FOR SEQ ID NO: 1701 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1701 
ATTATGGATC AGTTCTTGGA TCA 



2) INFORMATION FOR SEQ ID NO: 1702 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus gordonii 

(B) STRAIN: Challis V288 

(C) ACCESSION NUMBER: L20574 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1702 

TTCATAGACG CTGAGCACGC TTTGGATCCA TCTTACGCGG CTGCTCTAGG 
TGTAAATATT GATGAGCTGT TGCTATCTCA ACCAGATTCT GGTGAGCAAG 
GTTTAGAAAT TGCAGGAAAA TTGATTGACT CTGGGGCAGT TGATTTAGTT 

881 
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GTCATCGACT CTGTTGCAGC TCTTGTACCA CGTGCGGAAA TCGATGGAGA 200 
TATCGGTGAT AGC 213 



2) INFORMATION FOR SEQ ID NO: 1703 



10 



15 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 692 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mutans 

(B) STRAIN: strain GS-5 

(C) ACCESSION NUMBER: M61897 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1703 



GGGCCGGAAT 
GGCGCAAAAA 

25 TTGATCCAGC 
CTTTCACAAC 
GATTGATTCT 
TAGTACCACG 
TTACAAGCAC 

3 0 C AAT AAAACA 
TTGGTATTAT 
AAGTTTTATT 
AGGAACCGGG 
AAGTTGTTAA 

35 ATTATATATG 
CAGTGATTTG 



CTTCTGGTAA 
GATGGCGGTA 
CTATGCTGCT 
CAGATTCAGG 
GGCGCTGTTG 
TGCGGAGATT 
GCATGATGAG 
AAAACCATTG 
GTTTGGTAAT 
CTTCTGTGCG 
GAACAAAAAG 
AAATAAAGTT 
GTGAAGGCAT 
GGAATTATCC 



GACAACTGTC 
TTGCCGCTTT 
GCTCTTGGCG 
AGAACAGGGT 
ATTTAGTTGT 
GACGGAGATA 
TCAAGCGATG 
CTATTTTTAT 
CCAGAAACAA 
TCTTGATGTC 
ACAGCAATAT 
GCTCCACCAT 
TTCTCGTACA 
AAAAAGCTGG 



GCTCTTCATG 
CATTGATGCA 
TTAATATTGA 
CTTGAAATTG 
TGTTGACTCA 
TTGGTAATAG 
CGTAAATTAT 
TAATCAATTG 
CCCCTGGCGG 
CGCGGCAATA 
TGGTAAAGAG 
TTAAGGAAGC 
GGTGAATTAG 
AGCTTGGTAC 



CTGCTGCTCA 
GAACATGCCC 
TGAGCTTTTG 
CAGGGAAATT 
GTGGCAGCTT 
TCATGTTGGC 
CAGCTTCAAT 
CGGGAAAAAG 
GCGTGCCTTG 
CTCAAATTAA 
ACCAAAATTA 
TTTTGTAGAA 
TTAAGATTGC 
TC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
692 



40 2) INFORMATION FOR SEQ ID NO: 1704 



SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1204 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
(C) ACCESSION NUMBER: Z17307 



(i) 

45 
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ATGGCGAAAA 
AGAACGTGAA 
5 TTGGTAAAGG 
CAAGTGATGA 
TGGTTATCCT 
GTAAGACAAC 
GGGATTGCTG 

10 TGCGGCCCTT 
CAGGAGAGCA 
GTTGATCTTG 
AATTGATGGA 
TGAGCCAGGC 

15 ATTGCCATTT 
AAATCCAGAA 
TCCGCTTGGA 
AAAGAAACCA 
GGTAGCTCCA 

20 GAATTTCTAA 
ATCAAAAAAG 
AGGTTCTGAG 
ATGAAATTGA 
GAAGTTTCAG 

2 5 AGAAGCAGTG 
TCGAAATTGA 
TCGA 



AACCAAAAAA 
AAGGCCTTGA 
ATCAATCATG 
GCTCAGGTTC 
AAGGGACGTA 
GGTTGCCCTT 
CCTTTATCGA 
GGTGTCAATA 
AGGTCTTGAG 
TCGTAGTCGA 
GATATCGGAG 
CATGCGTAAA 
TTATCAACCA 
ACAACACCGG 
TGTTCGTGGT 
ATGTCGGTAA 
CCGTTTAAGG 
GACTGGTGAG 
CAGGGGCTTG 
AATGCTAAGA 
TAAGCAAGTC 
AACAAGATAC 
AATGAAGAAG 
AGAATAAGCT 



ATTAGAAGAA 
ATGACGCTCT 
CGTTTGGGTG 
TTTAGCTCTT 
TCATCGAAAT 
CATGCAGTTG 
TGCGGAACAT 
TTGACGAATT 
ATTGCGGGAA 
CTCAGTTGCT 
ATAGCCATGT 
CTTGGCGCCT 
ATTGCGTGAA 
GCGGACGTGC 
AATACACAAA 
AGAAACTAAG 
AAGCCGTAGT 
CTTTTGAAGA 
GTATTCTTAC 
AATACTTGGC 
CGTTCTAAAT 
TGAAAACAAA 
TTCCGCTTGA 
GTTAAAGCAG 



ATTTCAAAAA 
TAAATTGATT 
AACGTGCGGA 
GACATTGCCC 
CTATGGCCCA 
CACAAGCGCA 
GCCCTTGATC 
GCTCTTGTCT 
AATTGATTGA 
GCCCTTGTTC 
TGGTTTGCAG 
CTATCAATAA 
AAAGTTGGAG 
TTTGAAATTC 
TTAAGGGAAC 
ATTAAGGTTG 
TGAAATTATG 
TTGCAAGCGA 
AAAGATGAAA 
AGAGCACCCA 
TTGGCTTGAT 
AAAGATGAGC 
CTTAGGCGAT 
TGGAGAAATC 



AATTTGGGGC 
GAGAAAGACT 
GCAAAAGGTG 
TTGGCTCAGG 
GAGTCATCTG 
AAAAGAAGGT 
CAGCTTATGC 
CAACCAGACT 
CTCAGGTGCA 
CTCGTGCGGA 
GCTCGTATGA 
AACCAAAACA 
TGATGTTTGG 
TATGCTTCAG 
TGGTGATCAA 
TAAAAAATAA 
TACGGAGAAG 
TTTGGATATT 
AAATTGGGCA 
GAAATCTTTG 
TGATGGAGAA 
CAAAGAAAGA 
GAACTTGAAA 
CGCTACTTTT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1050 
1100 
1150 
1200 
1204 



30. 



2) INFORMATION FOR SEQ ID NO: 1705 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 981 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



45 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pyogenes 

(B) STRAIN: NZ131 

(C) ACCESSION NUMBER: U21934 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1705 



ATGCGTTCAG 
TCCTAAAGGA 
50 CGACTGTGGC 
GCAGCCTTTA 
GCTTGGGGTT 
AACAAGGACT 



GAAGTCTAGC 
CGTATCATCG 
TTTACATGCT 
TCGATGCCGA 
AATATTGATG 
TGAAATTGCA 



TCTTGATATT 
AAATCTATGG 
GTAGCACAAG 
GCATGCGCTT 
AACTTCTCTT 
GGTAAATTGA 



GCTTGGATAG 


CTGGTGGTTA 


50 


TCCAGAGTCT 


TCCGGTAAAA 


100 


CTCAAAAAGA 


AGGTGGAATC 


150 


GATCCAGCTT 


ATGCTGCTGC 


200 


GTCTCAACCA 


GATTCTGGAG 


250 


TTGATTCTGG 


TGCGGTTGAC 


300 



883 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



10 



CTGGTTGTTG 
TGGTGATATT 
AGGCCATGCG 
ATCTTTATCA 
TGAAACAACA 
TGGATGTGCG 
GCCAGCATTG 
TCCGCCATTT 
CTCGTACAGG 
AAAGCAGGTG 
TGAAAATGCT 
TCGACCGTAA 
GAATCTGCTA 
TTTAGATAAT 



TCGATTCAGT 
GGCGATAGCC 
TAAATTATCA 
ACCAATTGCG 
CCAGGTGGTC 
TGGAAACAAC 
GTAAGGAGAC 
AAGGTAGCAG 
GGAGCTTGTG 
CTTGGTTCTC 
AAGCGTTATT 
AGTACGTGTT 
TGGCAGTAGC 
GGTATTGAAA 



AGCAGCTTTA 
ATGTCGGATT 
GCTTCTATTA 
TGAAAAAGTT 
GAGCTTTGAA 
CAAATTAAAG 
CAAAATCAAG 
AAGTTGAAAT 
AAAATTGCTT 
TTATAATGGT 
TGGCCGATCA 
AAATTTGGTT 
ATCAGAAGAA 
TTGAAGATTA 



GTGCCACGTG 
GCAAGCACGT 
ATAAAACAAA 
GGTGTGATGT 
ATTCTATGCT 
GAACTGGTGA 
GTTGTTAAAA 
CATGTATGGG 
CTGATTTGGA 
GAGAAGATTG 
TCCACAATTG 
TGCTTGAAGA 
ACCGATGATC 
A 



CTGAAATTGA 
ATGATGAGTC 
AACTATCGCA 
TTGGAAATCC 
TCTGTTCGGC 
CCAAAAGATA 
ACAAGGTCGC 
GAAGGTATTT 
CATTATCCAA 
GCCAAGGTTC 
TTTGATGAAA 
AAGCGAAGAA 
TTGCTTTAGA 



350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
981 



15 



2) INFORMATION FOR SEQ ID NO: 1706 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus salivarius subsp 
30 thermophilics 

(C) ACCESSION NUMBER: M94062 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1706 



35 GCGTATGCAC GAGCTCTAGG TGTTAATATC 

GCCTGATTCT GGTGAGCAAG GTCTCGAAAT 

CTGGTGCAGT GGATTTAGTT GTTGTTGACT 

CGTGCAGAAA TTGATGGAGA TAGTGGTGAC 

GCGTATGATG AGTCAAGCCA TGCGTAAACT 

40 CAAAAACGAT TGCTATCTTT ATTAACCAGT 

ATGTTTGGTA AC 



GATGAGCTTC 


TTTTGTCGCA 


50 


TGCAGGTAAG 


CTGATTGACT 


100 


CAGTTGCGGC 


CTTCGTACCA 


150 


AGTCATGTAG 


GACTTCAAGC 


200 


TTCTGCATCT 


ATTAATAAAA 


250 


TGCGTGAAAA 


AGTTGGTATC 


300 






312 



45 2) INFORMATION FOR SEQ ID NO: 1707 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 831 bases 

(B) TYPE: Nucleic acid 
50 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: J02967 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1707 



10 



15 



20 



ATGAAAAACA 
TATTATCTAC 
CTCCATTATT 
ACAAACGCTG 
GGCACCAGAT 
CGGAAATAAT 
GGAATGGAGA 
TTCTGTTGTT 
AAATCAAGAA 
GGAGATAAAG 
CTTAAAAATT 
ATCACAATCT 
ATGTATCTAC 
TGCAGGATTC 
TTATTATAAG 
GGAAACTTGG 
GATCACCATT 



CAATACATAT 
AGCAGCGCCA 
TGAAGGAACT 
AAATTGCTCA 
TCAACTTTCA 
AGATCAGAAA 
TCTGGAACAG 
TGGGTTTCGC 
TTATCTCAAA 
AAAGAAACAA 
TCACCAGAAG 
CCCAGTTAAA 
AAGATCTGGA 
ACAGCAAATA 
CAAATCAGGA 
GGTCGAATTT 
CTAAACACAC 



CAACTTCGCT 
GTGCATCAAC 
GAAGGTTGTT 
ATTCAATAAA 
AGATCGCATT 
ACCATATTCA 
CAATCATACA 
AAGAAATAAC 
GATTTTGATT 
CGGATTAACA 
AACAAATTCA 
AACTCAGCCA 
TAATAGTACA 
GAACCTTACA 
CATAAATATG 
AACATCAAGC 
TAAATTTATA 



ATTTTTTTAA 
AGATATCTCT 
TTTTACTTTA 
GCAAAGTGTG 
ATCACTTATG 
AATGGGATAA 
CCAAAGACGT 
CCAAAAAATT 
ATGGAAATCA 
GAAGCATGGC 
ATTCCTGCGT 
TAGAAAACAC 
AAACTGTATG 
AAACGGATGG 
TTTTTGTGTC 
ATAAAAGCCA 
A 



TAATTGCAAA 
ACTGTTGCAT 
CGATGCATCC 
CAACGCAAAT 
GCATTTGATG 
AACCCCCAAA 
GGATGCAATT 
AGATTAAATA 
AGACTTCTCT 
TCGAAAGTAG 
AAAATTATTA 
CATAGAGAAC 
GGAAAACT GG 
TTTGAAGGGT 
CGCACTTACA 
AGAAAAATGC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
831 



25 



2) INFORMATION FOR SEQ ID NO: 1708 



30 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: HH22 

(C) ACCESSION NUMBER: M60253 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1708 



45 TTGAAAAAGT 
TAATTCAAAC 
ATAATGCTCA 
GTAAAATTTA 
GATAAATAGT 

50 AAAAAGTACA 
GAAAAATATG 
AATGACATAT 
GTGGAATCAA 



TAATATTTTT 
AGTTCACATG 
TATTGGTGTT 
ATTCAGATAA 
GCTATTTTGT 
TATTAACAAA 
TAGGAAAAGA 
AGTGATAATA 
AAAAGTTAAA 



AATTGTAATT 
CCAAAGAGTT 
TATGCTTTAG 
GAGATTTGCC 
TAGAACAAGT 
GAT GAT AT AG 
TATCACTTTA 
CAGCAAACAA 
CAACGTCTAA 



GCTTTAGTTT 
AAATGATTTA 
ATACTAAAAG 
TATGCTTCAA 
ACCTTATAAT 
TTGCTTATTC 
AAAGCACTTA 
TAAAATTATA 
AAGAACTAGG 



TAAGTGCATG 
GAAAAAAAAT 
TGGTAAGGAA 
CTTCAAAAGC 
AAGTTAAATA 
TCCTATTTTA 
TTGAGGCTTC 
AAAGAAATCG 
AGATAAAGTA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
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ACAAATCCAG 
CAAAAAAGAT 
TTATCGCAAA 
TTAATGTTAA 
AAAAGACTAT 
CTAGAAATGA 
GTTTTAGTCA 
TAAGTTGATA 



TTAGATATGA 
ACTTCAACAC 
TGGAAAATTA 
ATAATAAAAG 
AAGGTTGCTG 
TGTTGCTTTT 
TTTTTACGAA 
AGTGAAACCG 



GATAGAATTA 
CTGCTGCTTT 
AGCAAAGAAA 
CGGAGATACT 
ATAAAAGTGG 
GTTTATCCTA 
TAAAGACAAT 
CCAAGAGTGT 



AATTACTATT 
CGGTAAGACT 
ACAAAAAATT 
TTAATTAAAG 
TCAAGCAATA 
AGGGCCAATC 
AAAAGTGATA 
AATGAAGGAA 



CACCAAAGAG 
TTAAATAAAC 
CTTACTTGAT 
ACGGTGTTCC 
ACATATGCTT 
TGAACCTATT 
AGCCAAATGA 
TTTTAA 



500 
550 
600 
650 
700 
750 
800 
846 



10 



15 



20 



25 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1709 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 555 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 
(C) ACCESSION NUMBER: M29695 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1709 



ATGTCCGCGA 
CCTGCCGATG 
GGGGTGGCGA 
CTGCCCAGAG 
GGGCGAGGAA 
GTGATGGCTG 
CAGTCTCTGG 
TGTCCGCGCT 
TTCAGACCGA 
AAGGCAGGAT 
GGTTTACATG 
CCTAA 



GCACCCCCCC 


CATAACTCTT 


CGCCTCATGA 


CCGAGCGCGA 


50 


CTCCATGACT 


GGCTCAACCG 


GCCGCACATC 


GTTGAGTGGT 


100 


CGAAGAGCGA 


CCGACTCTTG 


ATGAAGTGCT 


GGAACACTAC 


150 


CGATGGCGGA 


AGAGTCCGTA 


ACACCGTACA 


TCGCAATGCT 


200 


CCGATCGGCT 


ATGCTCAGTC 


GTACGTCGCG 


CTCGGAAGCG 


250 


GTGGGAAGAT 


GAAACTGATC 


CAGGAGTGCG 


AGGAATAGAC 


300 


CTGACCCGAC 


ACAGTTGAAC 


AAAGGCCTAG 


GAACAAGGCT 


350 


CTCGTTGAAC 


TACTGTTCTC 


GGACCCCACC 


GTGACGAAGA 


400 


CCCGACTCCG 


AACAACCATC 


GAGCCATACG 


CTGCTATGAG 


450 


TCGTGCGGGA 


GAAGATCATC 


ACCACGCCTG 


ACGGGCCGGC 


500 


GTTCAAACAC 


GACAAGCCTT 


CGAGAGAAAG 


CGCGGTGTTG 


550 
555 



40 



2) INFORMATION FOR SEQ ID NO: 1710 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 732 bases ' 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: K02987 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1710 

5 





ATGAACCAGA 


AAAACCCTAA 


AGACACGCAA 


AATTTTATTA 


CTTCTAAAAA 


50 




PPATHTAAAA 


ftAAATATTCiA 


ATPAPAPPAA 


TATPAGTAAA 
x xi x V-**J v_x x ruin 


PAA^APAAPft 


100 

A. \J \J 




TAAT APAAAT 


P^HATP Af^C^A 


A A ACC AP ATT 


TTAPPAAAftA 


HPTAC^TP AAA 


1 SO 




J. ono X on 1 


pac^ttapt^p 


TATAflAAATT 
X ix X AOAnn X x 


PATftPA^HPT 

On x o o ^ X 


TATGTPAAHT 
X r\, X X v^nftO X 




10 










A J. ILAAACLjCj 






ATATTCTAAA 


ATTTTCCTTC 


CCAAAACATA 


TAAACTATAA 


GATATATGGT 


300 




AATATTCCTT 


ATAACATCAG 


TACGGATATT 


GTCAAAAGAA 


TTACCTTTGA 


350 




AAGTCAGGCT 


AAATATAGCT 


ATCTTATCGT 


TGAGAAGGGA 


TTTGCGAAAA 


400 




GATTGCAAAA 


TCTGCAACGA 


GCTTTGGGTT 


TACTATTAAT 


GGTGGAGATG 


450 


15 


GATATAAAAA 


TGCTCAAAAA 


AGTACCACCA 


CTATATTTTC 


ATCCTAAGCC 


500 




AAGTGTAGAC 


TCTGTATTGA 


TTGTTCTTGA 


ACGACATCAA 


CCATTGATTT 


550 




CAAAGAAGGA 


CTACAAAAAG 


TATCGATCTT 


TTGTTTATAA 


GTGGGTAAAC 


600 




CGTGAATATC 


GTGTTCTTTT 


CACTAAAAAC 


CAATTCCGAC 


AGGCTTTGAA 


650 




GCATGCAAAT 


GTCACTAATA 


TTAATAAACT 


ATCGAAGGAA 


CAATTTCTTT 


700 


20 


CTATTTTCAA 


TAGTTACAAA 


TTGTTTCACT 


AA 




732 



25 



30 



2) INFORMATION FOR SEQ ID NO: 1711 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



35 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: BM2570 

(C) ACCESSION NUMBER: M19270 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1711 



40 



45 



50 



ATGAACAAAA 
ACTCAACCAA 
ACGAAATTGG 
AGTAAACAGG 
GTCAGAAAAA 
TTCTACAGTT 
ATTCCTTACC 
CCATGCGTCT 
CCTTGGATAT 
ATTCAGCAAT 
AGTAAACAGT 
ATAAATATTG 
GAATATCGTC 



ATATAAAATA 
ATAATAAAAC 
AACAGGTAAA 
TAACGTCTAT 
TTAAAATCGA 
TCAATTCCCT 
ATTTAAGCAC 
GACATCTATC 
TCACCGAACA 
TGCTTAAGCT 
GTCTTAATAA 
GAAGCTATAT 
AACTGTTTAC 



TTCTCAAAAC 
AATTGAATTT 
GGGCATTTAA 
TGAATTAGAC 
ATACTCGTGT 
AACAAACAGA 
ACAAATTATT 
TGATTGTTGA 
CTAGGGTTGC 
GCCAGCGGAA 
AACTTACCCG 
ACGTACTTTG 
TAAAAATCAG 



TTTTTAACGA 
AAAAGAAACC 
CGACGAAACT 
AGTCATCTAT 
CACTTTAATT 
GGTATAAAAT 
AAAAAAGTGG 
AGAAGGATTC 
TCTTGCACAC 
TGCTTTCATC 
CCATACCACA 
TTTCAAAATG 
TTTCATCAAG 



GTGAAAAAGT 
GATACCGTTT 
GGCTAAAATA 
TCAACTTATC 
CACCAAGATA 
TGTTGGGAAT 
TTTTTGAAAG 
TACAAGCGTA 
TCAAGTCTCG 
CTAAACCAAG 
GATGTTCCAG 
GGTCAATCGA 
CAATGAAACA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
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CGCCAAAGTA AACAATTTAA GTACCGTTAC TTATGAGCAA GTATTGTCTA 700 
TTTTTAATAG TTATCTATTA TTTAACGGGA GGAAATAA 738 



2) INFORMATION FOR SEQ ID NO: 1712 



10 



15 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 735 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: RN451 

(C) ACCESSION NUMBER: M17990 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1712 



ATGAACGAGA 
TAATATAGAT 

25 TCTTTGAAAT 
AGGTGTAATT 
TACAGAAAAT 
ATATATTGCA 
AATATACCTT 

3 0 TAGTATAGCT 
GATTATTAAA 
GATATTTCTA 
TAGAGTGAAT 
CACACAAAGA 

3 5 AAAGAAT ACA 
ACATGCAGGA 
CTCTTTTCAA 



AAAATATAAA 
AAAATAATGA 
CGGCTCAGGA 
TCGTAACTGC 
AAACTTGTTG 
GTTTAAATTT 
ATAACATAAG 
GATGAGATTT 
TACAAAACGC 
TATTAAGTAT 
AGCTCACTTA 
TAAACAGAAG 
AGAAAATATT 
ATTGACGATT 
TAGCTATAAA 



ACACAGTCAA 
CAAATATAAG 
AAAGGGCATT 
CATTGAAATA 
ATCACGATAA 
CCTAAAAACC 
TACGGATATA 
ATTTAATCGT 
TCATTGGCAT 
GGTTCCAAGA 
TCAGATTAAA 
TATAATTATT 
TACAAAAAAT 
TAAACAATAT 
TTATTTAATA 



AACTTTATTA 
ATTAAATGAA 
TTACCCTTGA 
GACCATAAAT 
TTTCCAAGTT 
AATCCTATAA 
ATACGCAAAA 
GGAATACGGG 
TATTTTTAAT 
GAATATTTTC 
TAGAAAAAAA 
TCGTTATGAA 
CAATTTAACA 
TAGCTTTGAA 
AGTAA 



CTTCAAAACA 
CATGATAATA 
ATTAGTACAG 
TATGCAAAAC 
TTAAACAAGG 
AATATTTGGT 
TTGTTTTTGA 
TTTGCTAAAA 
GGCAGAAGTT 
ATCCTAAACC 
TCAAGAATAT 
ATGGGTTAAC 
ATTCCTTAAA 
CAATTCTTAT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
735 



40 



50 



2) INFORMATION FOR SEQ * ID NO: 1713 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1029 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: V583 

888 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1713 



5 ATGAATAAAA 
TGATGTGTCG 
AAAAATTCGA 
CTATGCAAGA 
ATTCTCCCCG 

10 AATACGAAAC 
TGCGGGGAGG 
CTATGTAGGC 
TGGCCTACAT 
ATGATTGAAA 

15 CTTTGTGAAG 
ACAGTACGGA 
GGAAAAATCT 
GGTCATGGGA 
GGTTGAGCCA 

2 0 GGCTCAGAGA 
ACGAAATCGG 
GCAGAGGGCT 
GTTCTAAACG 
TCCACGCATG 

25 GCCTGATTAC 



TAAAAGTCGC 
GTAAAATCCG 
TCCGCACTAC 
AGCCATGTAC 
GATAGGAAAA 
TCGGCGTATT 
ATGGTGCGAT 
TGCGATATTC 
TCTTACAAAA 
AAGGTGACAA 
CCGGCACGGT 
AGAACTAAAC 
TAATTGAGCA 
AACGAGGATG 
CGGTATCTTC 
ATGCGATGAT 
GTGCAAGAAA 
TGCTCGTGTT 
AGGTCAATAC 
GCGGCTGCCG 
ATTGGCGATA 



AATTATCTTC 
CAATAGAAAT 
ATCGGAATTA 
GGAATGGGAA 
CGCATGGTCT 
GACGTGGCTT 
ACAGGGTCTG 
AAAGCTCCGC 
AATGCGGGCA 
ACCGGAGGCG 
CAGGTTCGTC 
GCTGCGATAG 
AGCGATTTCG 
ATTTGATTGT 
CGCATCCATC 
TATCGTTCCA 
CGGCAAAGAA 
GATCTTTTTT 
CCTGCCCGGT 
CAGGAATCAC 
GAGAGGTGA 



GGCGGTTGCT 
TGCTGCGAAC 
CAAAAAACGG 
GCCGATAGTC 
GCTTGTCATG 
TCCCGGTTTT 
TTTGAATTGT 
AGCTTGCATG 
TCGCCGTCCC 
AGGACGCTTA 
CTTTGGCGTA 
AAGCAGCAGG 
GGCTGTGAGG 
CGGCGAAGTG 
AGGAAAACGA 
GCAGACATTC 
AGTATATCGG 
TGCAGGAGGA 
TTTACATCGT 
GCTTCCCGCA 



CGGAGGAACA 
ATTAATACTG 
CGTATGGAAG 
TCCCCGCCAT 
AAAGAAAGAG 
GCATGGCAAA 
CTGGTATCCC 
GACAAATCAC 
CGAATTTCAA 
CCTACCCTGT 
ACCAAAGTAA 
ACAATATGAT 
TCGGCTGCGC 
GATCAAATCC 
GCCGGAAAAA 
CGGTCGAGGA 
GTGCTTGGAT 
TGGCGGCATC 
ACAGCCGCTA 
CTAATTGACA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
1000 
1029 



2) INFORMATION FOR SEQ ID NO: 1714 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
35 ( D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
40 (A) ORGANISM: Campylobacter jejuni subsp. jejuni 

(B) STRAIN : ATCC 33292 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1714 

45 GCACAGGTTC AGTTGGGCTT GATCTTGCTT TAGGTATAGG CGGTGTTCCA 50 

AAAGGAAGAA TTATAGAAAT TTATGGRCCT GAAAGTTCAG GTAAAACCAC 100 

TCTAACTCTA CACATTATCG CAGAATGCCA AAAAGCAGGT GGRGTTTGTG 150 

CTTTTATCGA TGCAGAACAT GCRCTTGATG TRAAATATGC TAAAAATTTG 200 

GGTGTAAATA CAGATGATTT GTATGTTTCT CAGCCTGATT TTGGAGARCA 250 

50 AGCCTTAGAA ATTGTAGAAA CTATAGCAAG AAGTGGCGCA GTAGATCTTA 300 

TCGTAGTAGA TAGCGTTGCA GCGCTTACCC CAAAAGCAGA AATTGAAGGC 350 

GATATGGGTG ATCAACATGT AGGACTTCAA GCAAGACTTA TGTCTCAAGC 4 00 

TCTAAGAAAA CTTACAGGTA TAGTTCATAA AATGAATACC ACAGTAATTT 450 
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TYATCAACCA 
GAAACCACAA 
AGATGTTAGA 
ACCGCGTTAA 
CAAGCTGAAT 
ATTGATCGAT 
GGTTTTCTTA 
GCTTTCTTAA 



AATTCGTATG 
CAGGTGGAAA 
AAAGTAGCAA 
AGTAAAAGTA 
TTGATGTGAT 
TATGGTGTAA 
TAAAGATAAA 
AAGAAAAC 



AAAATCGGTG 
TGCATTAAAA 
CCTTAAAACA 
GTTAAAAATA 
GTTTGGAGAG 
AACTTGATAT 
AAACTTGGAC 



CTATGGGTTA 
TTTTATGCTT 
AAACGAAGAA 
AAGTTGCTCC 
GGTTTAAGCC 
CGTAGATAAA 
AAGGTAGAGA 



TGGTACTCCT 
CTGTGCGTTT 
CCTATAGGAA 
TCCATTCAGA 
GTGAAGGTGA 
AGTGGTGCGT 
AAATTCAAAA 



500 
550 
600 
650 
700 
750 
800 
818 



15 



20 



25 



30 



35 



40 



2) INFORMATION FOR SEQ ID NO: 1715 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 809 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Abiotrophia adiacens 

(B) STRAIN: ATCC 49175 

(C) ACCESSION NUMBER: AF124224 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1715 



45 



TGGTGCTATC 
GTGAACACAT 
TTCTTAAACA 
AGAAATGGAA 
ACACTCCAGT 
TCATACRAAG 
TCCAACTCCA 
ACGTGTTCTC 
CGTGGACAAG 
AGAAACTTCA 
TAGACTACGC 
ACACGTGACA 
CACTCCACAT 
AAGGTGGACG 
TTCCGTACAA 
AATGGTAATG 
CAGTAGCGA 



TTAGTAGTAT 
CTTATTATCA 
AAGTTGACAT 
GTTCGTGACT 
TGTTGCAGGT 
AAAAAATCTT 
GAACGYGACG 
AATCACAGGT 
TTCGTGTTGG 
AAAACAACTG 
TGAAGCAGGG 
ACATCGAACG 
ACTAAATTCA 
TCATACTCCA 
C AG AC AT C AC 
CCTGGTGATA 



CTGCAGCTGA 
CGTCAAGTAG 
GGTTGACGAT 
TATTATCAGA 
TCTGCTTTAC 
AGAATTAATG 
TTGACAAACC 
CGTGGTACTG 
TGACGAAGTT 
TAACTGGTGT 
GATAACATTG 
TGGACAAGTT 
AAGCTGAAGT 
TTCTTCTCTA 
TGGTGTTTGT 
ACGTAACTAT 



TGGTCCAATG 
GTGTTCCTTA 
GAAGAATTAT 
ATACGATTTC 
GCGCTTTAGA 
GCTGCTGTTG 
ATTCATGATG 
TTGCTACAGG 
GAAATCGTTG 
TGAAATGTTC 
GTACATTATT 
CTTGCTAAAC 
TTACGTATTA 
ACTACCGTCC 
GTGTTACCAG 
GGAAGTTGAA 



CCTCAAACAC 
CATCGTTGTA 
TAGAATTAGT 
CCAGGCGATG 
AGGCGACGCT 
ACGAATACAT 
CCAGTTGAAG 
TCGTGTTGAA 
GTATTTCAGA 
CGTAAATTGT 
ACGTGGTGTT 
CAGGAACAAT 
ACTAAAGAAG 
TCAATTCTAC 
AAGGCGTTGA 
TTAATTCACC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
809 



2) INFORMATION FOR SEQ ID NO: 1716 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

890 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



10 



(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Abiotrophia defectiva 

(B) STRAIN: ATCC 49176 

(C) ACCESSION NUMBER: AF124225 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1716 



15 



20 



25 



CGGCGCGATC 
GTGAACACAT 
TTCTTGAACA 
TGAAATGGAA 
ACACTCCAGT 
AACTACGAAG 
TCCAGAACCA 
ACGTATTCTC 
CGTGGTCAAG 
AGAAACTTCT 
TGGATTACGC 
ACTCGTGACC 
CACTCCGYAC 
AAGGTGGTCG 
TTCCGTACAA 
AATGGTTATG 
CAATCGCGAT 



CTCGTTGTAT 
CCTCTTGTCT 
AAGTTGACAT 
GTTCGTGACC 
TATCGCTGGT 
CTAAAGTTTT 
GAACGTGACA 
TATCACTGGT 
TTCGCGTTGG 
AAGACTACCG 
TGAAGCTGGG 
AAATCCAACG 
ACTAAGTTCG 
TCACACTCCA 
CTGACGTAAC 
CCAGGCGACA 
CGAAGAA 



CTGCTGCTGA 
CGTCAAGTTG 
GGTTGACGAC 
TCTTGTCTGA 
TCAGCTTTGA 
AGAATTGATG 
CTGACAAGCC 
CGTGGTACTG 
TGACGAAGTT 
TTACCGGTGT 
GACAACGTTG 
TGGTCAAGTA 
AAGCTGAAGT 
TTCTTCTCTA 
TGGTGTTGTT 
ACGTACAAAT 



CGGCCCAATG 
GTGTTCCTTA 
GAAGAATTGC 
ATACGACTTC 
AAGCTTTAGA 
GAACAAGTTG 
ATTCATGATG 
TTGCAACTGG 
GAAATCGTTG 
TGAAATGTTC 
GTACCTTGTT 
TTATCTAAAC 
GTACGTATTG 
ACTACCGTCC 
ACTTTACCAG 
GGTTGTTGAA 



CCACAAACTC 
CATCGTAGTA 
TCGAATTAGT 
CCAGGCGACG 
AGGCGACGCT 
ATGCTTACAT 
CCAGTCGAAG 
TCGTGTTGAA 
GTATCGAAGA 
CGTAAGTTAT 
ACGTGGTGTA 
CAGGTTCAAT 
TCTAAAGAAG 
ACAATTCTAC 
AAGGTACTGA 
TTGATCCACC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
817 



30 



35 



40 



2) INFORMATION FOR SEQ ID NO: 1717 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS:. Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



45 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium accolens 

(B) STRAIN: ATCC 49725 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1717 



CGGCGCTATC 
50 GCGAGCACGT 
GCACTGAACA 
GGAGATGGAG 
CTCCTATCGT 



CTGGTTGTTG 
TCTGCTTGCT 
AGTGCGACAT 
ATCTCCGAGC 
TCACATCTCC 



CTGCAACCGA 
CGCCAGGTTG 
GGTTGATGAT 
TGCTCGCAGA 
GCTCTGAAGG 



TGGCCCGATG 


CCGCAGACCC 


50 


GCGTTCCTTA 


CATCCTCGTT 


100 


GAGGAAATCA 


TCGAGCTCGT 


150 


GCAGGACTAC 


GATGAGGAAG 


200 


CACTCGAGGG 


TGACGAGAAG 


250 
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TGGGTACAGT 
TGATCCGGAG 
TCTTCACCAT 
GGTCGTCTGA 
GTCCCAGAAC 
ACTACACCGA 
CGTGAGGACG 
CCCTCACACC 
GCGGCCGCCA 
CGCACCACCG 
GGTTATGCCT 
TTGCTATGGA 



CCATCGTTGA 
CGCGCTACCG 
TACCGGCCGC 
ACGTCAACGA 
ACCACCGTTA 
GGCTGGCGAC 
TTGAGCGTGG 
AAGTTCGAGG 
CACCCCGYTC 
ACGTTACCGG 
GGCGACAACG 
CGAG 



CCTGATGGAT 
ATCAGCCGTT 
GGTACCGTTG 
GGACGTTGAG 
CCGGTATCGA 
AACTGTGGTC 
CCAGGTTGTT 
GTTCCGTCTA 
ATGAACAACT 
TGTTGTGAAC 
TTGAGATGTC 



GCCTGCGACA 
CTTGATGCCT 
TTACCGGCCG 
ATCATCGGTA 
GATGTTCCGC 
TGCTTCTGCG 
ATCAAGCCGG 
CGTCCTGAAG 
ACCGTCCTCA 
CTGCCTGAGG 
TGTTGAGCTC 



ACTCCATCCC 
ATCGAGGACA 
TGTTGAGCGT 
TCCAGGAGAA 
AAGATGATGG 
TGGTACCAAG 
GCGCTTACAC 
AAGGAAGAGG 
GTTCTACTTC 
GCACCGAGAT 
ATCCAGCCTG 



300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
814 
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15 
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2) INFORMATION FOR SEQ ID NO: 1718 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium genitalium 

(B) STRAIN: ATCC 33031 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1718 



CGGCGCCATC 


CTGGTTGTTG 


CTGCAACCGA 


TGGCCCGATG 


CCGCAGACCC 


50 


GTGAGCACGT 


TCTGCTGGCT 


CGCCAGGTTG 


GCGTTCCGTA 


CATCCTAGTT 


100 


GCACTGAACA 


AGTGCGACAT 


GGTTGATGAT 


GAGGAGCTGC 


TGGAGCTCGT 


150 


CGAGATGGAG 


GTCCGCGAGC 


TGCTGGCTGA 


GCAGGACTTC 


GACGAGGAAG 


200 


CACCTGTTGT 


TCACATCTCC 


GCACTGAAGG 


CCCTGGAGGG 


CGACGAGAAG 


250 


TGGGCTAAGC 


AGATCCTGGA 


GCTCATGGAG 


GCTTGCGACA 


ACTCCATCCC 


300 


GGATCCGGAG 


CGCGAGACCG 


ACAAGCCGTT 


CCTGATGCCG 


GTTGRGGACA 


350 


TCTTCACCAT 


TACCGGCCGC 


GGTACCGTTG 


TTACCGGCCG 


TGTTGAGCGT 


400 


GGCGTCCTGA 


ACCTGAACGA 


CGAGGTCGAG 


ATCCTGGGCA 


TCCGCGAGAA 


450 


GTCCACCAAG 


ACCACCGTTA 


CCTCCATCGA 


GATGTTCAAC 


AAGCTGCTGG 


500 


ACACCGCAGA 


GGCTGGCGAC 


AACGCCGCAC 


TGCTGCTGCG 


TGGCCTGAAG 


550 


CGCGAAGATG 


TTGAGCGTGG 


TCAGATCGTT 


GCTAAGCCGG 


GCGAGTACAC 


600 


CCCGCACACC 


GAGTTCGAGG 


GCTCCGTCTA 


CGTTCTGTCC 


AAGGACGAGG 


650 


GTGGCCGCCA 


CACCCCGTTC 


TTCGACAACT 


ACCGTCCGCA 


GTTCTATTTC 


700 


CGCACCACCG 


ACGTTACCGG 


TGTTGTGAAG 


CTGCCGGAGG 


GCACCGAGAT 


750 


GGTTATGCCG 


GGCGACAACG 


TTGACATGTC 


CGTCACCCTG 


ATCCAGCCGG 


800 


TTGCTATGGA 


CGAG 








814 



2) INFORMATION FOR SEQ ID NO: 1719 



892 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium jeikeium 

(B) STRAIN: ATCC 43216 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1719 



15 CGGCGCCATC 
GCGAGCACGT 
GCACTGAACA 
CGAGATGGAG 
CTCCGGTTGT 

20 TGGGCTAACC 
GGATCCGGAG 
TCTTCACCAT 
GGCATCCTGA 
GTCCCAGAAG 

25 ACACCGCAGA 
CGCGAGGACG 
CCCGCACACC 
GCGGCCGCCA 
CGCACCACCG 

30 GGTTATGCCG 
TTGCTATGGA 



CTGGTTGTTG 
TCTGCTGGCY 
AGTGTGACAT 
GTCCGCGAGC 
TCACATCTCC 
AGATTCTCGA 
CGCGAGACCG 
TACCGGTCGC 
ACCTGAACGA 
ACCACCGTTA 
GGCTGGCRAC 
TTGAGCGTGG 
GAGTTCGAGG 
CACCCCGTTC 
ACGTTACCGG 
GGCGACAACG 
CGAG 



CCGCAACCGA 
CGCCAGGTTG 
GGTTGACGAT 
TGCTGGCTGA 
GCACTGAAGG 
GCTGATGCAG 
ACAAGCCGTT 
GGTACCGTTG 
CGAGGTTGAG 
CCTCCATCGA 
AACGCTGCAC 
CCAGATCATC 
GCTCCGTCTA 
TTCGACAACT 
TGTTGTGAAG 
TYGACATGTC 



TGGCCCGATG 
GCGTTCCGTA 
GAGGAGCTGC 
GCAGGACTTC 
CCCTGGAGGG 
GCTTGCGACG 
CCTGATGCCG 
TTACCGGCCG 
ATCCTGGGTA 
GATGTTCAAC 
TGCTGCTGCG 
GCTAAGCCGG 
CGTTCTGTCC 
ACCGTCCGCA 
CTGCCTGAGG 
CGTCACCCTG 



CCGCAGACCC 
CATCCTGGTT 
TGGAGCTCGT 
GACGAGGAAG 
CGACGAGAAG 
AGTCTATCCC 
GTTGWGGACA 
TGTTGAGCGT 
TCCGCGAGAA 
AAGCTGCTGG 
TGGTCTGAAG 
GCGAGTACAC 
AAGGACGAGG 
GTTCTACTTC 
GCACCGAGAT 
ATCCAGCCGG 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
814 



35 2) INFORMATION FOR SEQ ID NO: 1720 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium pseudodiphtheriticum 

(B) STRAIN: ATCC 10700 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1720 

CGGCGCTATC TTGGTTGTTG CAGCTACCGA CGGCCCAATG CCACAGACTC 50 
GCGAGCACGT TCTGCTGGCT CGCCAGGTTG GCGTTCCTTA CATCCTGGTT 100 
GCACTAAACA AGTGCGACAT GGTTGACGAC GAGGAAATCC TCGAGCTCGT 150 

893 
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CGAGATGGAG 
CTCCAATCGT 
TGGGTTAACG 
TGATCCAGAC 
TCTTCACCAT 
GGTTCCCTGA 
GTCCCAGAAG 
ACTACACCGA 
CGTGAAGACG 
CACCCACAAG 
GCGGCCGCCA 
CGCACCACCG 



ATCCGCGAAT 
TCACATCTCC 
CCATCGTTGA 
CGTGCTACCG 
TACCGGTCGT 
AGGTCAACGA 
ACCACCATCA 
GGCCGGCGAC 
TTGAGCGTGG 
AAGTTCGAAG 
CACCCCGTTC 
ACGTTACCGG 



TGCTGGCTGA 
GCAGTCGGCG 
ACTGATGGAT 
ACAAGCCATT 
GGCACCGTTG 
AGAAGTCGAG 
CCGGTATCGA 
AACGCTGGTC 
ACAGGTTATC 
GTTCCGTCTA 
TTCGACAACT 
TGTTGTTACC 



CCAGGAATTC 
CCTTGGAAGG 
GCTTGTGACG 
CCTGATGCCT 
TTACGGGTCG 
ATCATCGGCA 
AATGTTCCGC 
TGCTGCTTCG 
GTTGCTCCAG 
CGTTCTTTCC 
ACCGTCCTCA 
CTGCCTGAGG 



GACGAAGAAG 
CGAAGAGAGG 
AGTCGATCCC 
ATCGAGGACA 
TGTTGAGCGT 
TCAAGGAAAA 
AAGATGCTGG 
CGGTACCAAG 
GTGCTTACAG 
AAGGACGAGG 
GTTCTACTTC 
GCACCGAG 



200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
748 
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20 



25 



30 
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2) INFORMATION FOR SEQ ID NO: 1721 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium striatum 

(B) STRAIN: ATCC 6940 

-r J 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1721 



GGCGCTATCT 


TGGTTGTTGC 


TGCAACCGAT 


GGCCCGRTGC 


CGCAGACCCG 


50 


CGAGCACGTT 


CTTCTGGCTC 


GCCAGGTTGG 


CGTTCCTTAC 


ATCCTCGTTG 


100 


CACTGAACAA 


GTGCGACATG 


GTTGACGACG 


AGGAAATTAT 


CGAGCTCGTC 


150 


GAGATGGAGA 


TCCGCGAACT 


GCTCGCAGAG 


CAGGACTACG 


ATGAGGAAGC 


200 


TCCGATCGTT 


CACATCTCTG 


CTCTGAAGGC 


TCTTGAGGGC 


GRCGAGAAGT 


250 


GGGTACAGGC 


TATCGTTGAC 


CTGATGCAGG 


CTTGCGATGA 


CTCCATCCCG 


300 


GATCCGGAGC 


GCGAGCTGGA 


CAAGCCGTTC 


CTGATGCCAA 


TCGAGGACAT 


350 


CTTCACCATC 


ACCGGCCGCG 


GTACCGTTGT 


TACTGGCCGT 


GTTGAGCGTG 


400 


GCTCCCTGAA 


CGTCAACGAG 


GACGTTGAGA 


TCATCGGTAT 


CCAGGACARG 


450 


TCCATCTCCA 


CCACCGTTAC 


CGGTATCGAG 


ATGYTCCGCA 


AGATGATGGA 


500 


CTACACCGAG 


GCTGGCGACA 


ACTGTGGTCT 


GCTTCTGCGT 


GGTACCAAGC 


550 


GTGAAGAGGT 


TGAGCGCGGC 


CAGGTTGTTA 


TTAAGCCGGG 


CGCTTACACC 


600 


CCTCACACCC 


AGTTCGAGGG 


TTCCGTCTAC 


GTCCTGAAGA 


AGGAAGAGGG 


650 


CGGCCGCCAC 


ACCCCGTTCA 


TGGACAACTA 


CCGTCCGCAG 


TTCTACTTCC 


700 


GCACCACCGA 


CGTTACCGGC 


GTCATCAAGC 


TGCCTGAGGG 


CACCGAGATG 


750 


GTTATGCCTG 


GCGACAACGT 


CGAGATGTCY 


GTCGAGCTGA 


TCCAGCCGGT 


800 


CGCTATGGAC 


GAG 








813 



2) INFORMATION FOR SEQ ID NO: 1722 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus avium 

(B) STRAIN: ATCC 14025 

(C) ACCESSION NUMBER: AF124220 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1722 



15 



20 



25 



30 



CGGAGCTATC 
GTGAACACAT 
TTCTTAAACA 
TGAAATGGAA 
ACACTCCAGT 
TCATACGAAG 
CCCAACACCA 
ACGTATTCTC 
CGTGGACAAG 
CGAAACTGCT 
TAGACTACGC 
GCACGTGAAG 
CACTCCACAT 
AAGGTGGACG 
TTCCGTACAA 
AATGGTWATG 
CAATYGCGGT 



TTAGTAGTAT 
CTTGTTATCT 
AAATGGATAT 
GTTCGTGACT 
TATCGCAGGT 
AAAAAATCTT 
GTTCGTGATA 
AATCACTGGT 
TTCGCGTTGG 
AAAACAACTG 
TGAAGCAGGT 
ATATCCAACG 
ACAAAATTCT 
TCATACTCCA 
CTGACGTAAC 
CCTGGGGATA 
AGAAGAC 



CTGCTGCTGA 
CGTAACGTTG 
GGTTGACGAT 
TATTAACTGA 
TCAGCGTTGA 
AGAATTAATG 
CTGACAAACC 
CGTGGTACTG 
TGACGAAGTT 
TTACAGGTGT 
GACAACATCG 
TGGACAAGTA 
CTGCAGAAGT 
TTCTTCACTA 
TGGTGTAGTT 
ACGTAACTAT 



TGGCCCTATG 
GTGTTCCTTA 
GAAGAATTAC 
ATACGACTTC 
AAGCTTTAGA 
GCTGCTGTTG 
ATTCATGATG 
TTGCAACTGG 
GAAATCGTAG 
TGAAATGTTC 
GTGCTTTGTT 

, TTGGCTAAAC 
TTATGTTCTA 

..ACTACCGTCC 
GATCTACCAG 
GGAAGTTGAA 



CCTCAAACTC 
CATCGTTGTA 
TTGAATTAGT 
CCAGGCGACG 
AGGCGACGCT 
ACGAATATAT 
CCAGTCGAAG 
TCGTGTTGAA 
GTATCGCTGA 
CGTAAATTGT 
ACGTGGTGTT 
CAGCTTCAAT 
ACTAAAGAAG 
TCAGTTCTAC 
AAGGTACTGA 
TTGATYCACC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
817 



35 



45 



2) INFORMATION FOR SEQ ID NO: 1723 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 823 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Gardnerella vaginalis 

(B) STRAIN: ATCC 14018 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1723 

TGGCGCAATC CTCGTGGTTG CTGCTACCGA CGGTCCAATG GCTCAGACCC 50 
GTGAACACGT CTTGCTTGCT AAGCAGGTCG GCGTTCCAAA AATTCTTGTT 100 

895 
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GCTTTGAACA 


AGTGCGATAT 


GGTTGACGAC 


GAAGAGCTTA 


TCGATCTCGT 


150 




TGAAGAAGAG 


GTCCGTGACC 


TCCTCGAAGA 


AAACGGCTTC 


GATCGCGATT 


200 




GCCCAGTCYT 


CCGTACTTCC 


GCTTACGGCG 


CTTTGCATGA 


TGACGCTCCA 


250 




GACCACGACA 


AGTGGGTAGA 


GACCGTCAAG 


GAACTCATGA 


AGGCTGTTGA 


300 


5 


CGAGTACATC 


CCAACCCCAA 


CTCACGATCT 


TGACAAGCCA 


TTCTTGATGC 


350 




CAATCGAAGA 


TGTGTTCACC 


ATCTCCGGTC 


GTGGTYCCGT 


TGTCACCGGT 


400 




CGTGTTGAGC 


GTGGTAAGCT 


CCCAATCAAC 


ACCCCAGTTG 


AGATCGTTGG 


450 




TTTGCGCGAT 


ACCCAGACCA 


CCACCGTCAC 


CTCTATCGAG 


ACCTTCCACA 


500 




AGCAGATGGA 


TGAGGCAGAG 


GCTGGCGATA 


ACACTGGTCT 


TCTTCTCCGC 


550 


10 


GGTATCAACC 


GTACCGACGT 


TGAGCGTGGT 


CAGGTTGTGG 


CTGCTCCAGG 


600 




TTCTGTGACT 


CCACACACCA 


AGTTCGAAGG 


CGAAGTTTAC 


GTCTTGACCA 


650 




AGGACGAAGG 


TGGCCGTCAC 


TCGCCATTCT 


TCTCCAACTA 


CCGTCCACAG 


700 




TTPTAPTTfr 


CZTACCACCGA 




GTTATCACTT 




7 SO 




CATCGAAATG 


GTTCAGCCAG 


GCGATCACGC 


AACCTTCACT 


GTTGAGTTGA 


800 


15 


TCCAGGCTAT 


CGCAATGGAA 


GAG 






823 




2) INFORMATION FOR SEQ ID NO: 1724 








20 


(i) SEQUENCE CHARACTERISTICS: 










(A) 


LENGTH: 817 


bases 










(B) 


TYPE: Nucleic acid 










(C) 


STRANDEDNESS : Double 








25 


(D) 


TOPOLOGY: Linear 










(ii) MOLECULE TYPE: Genomic DNA 










(vi) ORIGINAL SOURCE: 












(A) 


ORGANISM: Listeria innocua 








(B) 


STRAIN: ATCC 33090 










(xi) SEQUENCE DESCRIPTION: SEQ ID 


NO: 1724 






35 


CGGAGCTATC 


TTAGTAGTAT 


CTGCTGCTGA 


TGGCCCAATG 


CCACAAACTC 


50 




GTGAACATAT 


CTTACTTTCA 


CGTCAAGTTG 


GTGTTCCATA 


CATCGTTGTA 


100 




TTCATGAACA 


AATGTGACAT 


GGTTGACGAT 


GAAGAATTAC 


TAGAATTAGT 


150 




TGAAATGGAA 


ATTCGTGATC 


TATTAACTGA 


ATATGAATTC 


CCTGGCGATG 


200 




ACATTCCTGT 


AATCAAAGGT 


TCAGCTCTTA 


AAGCACTTCA 


AGGTGAAGCT 


250 


40 


GACTGGGAAG 


CTAAAATTGA 


CGAGTTAATG 


GAAGCTGTAG 


ATTCTTACAT 


300 




TCCAACTCCA 


GAACGTGATA 


CTGACAAACC 


ATTCATGATG 


CCAGTTGAGG 


350 




ATGTATTCTC 


AATCACTGGT 


CGTGGAACAG 


TTGCAACTGG 


ACGTGTTGAA 


400 




CGTGGACAAG 


TTAAAGTTGG 


TGACGAAGTA 


GAAGTTATCG 


GTATTGAAGA 


450 




AGAAAGCAAA 


AAAGTAGTAG 


TAACTGGAGT 


AGAAATGTTC 


CGTAAATTAC 


500 


45 


TAGACTACGC 


TGAAGCTGGC 


GACAACATTG 


GCGCACTTCT 


ACGTGGTGTT 


550 




GCTCGTGAAG 


ATATCCAACG 


TGGTCAAGTA 


TTAGCTAAAC 


CAGGTTCGAT 


600 




TACTCCACAC 


ACTAACTTCA 


AAGCTGAAAC 


TTATGTTTTA 


ACTAAAGAAG 


650 




AAGGTGGACG 


TCACACTCCA 


TTCTTCAACA 


ACTACCGCCC 


ACAATTCTAT 


700 




TTCCGTACTA 


CTGACGTAAC 


TGGTATTGTT 


ACACTTCCAG 


AAGGTACTGA 


750 


50 


AATGGTAATG 


CCTGGTGATA 


ACATTGAGCT 


TGCAGTTGAA 


CTAATTGCAC 


800 




CAATCGCTAT 


CGAAGAC 








817 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria ivanovii 

(B) STRAIN: ATCC 19119 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1725 



CGGAGCTATC 
GTGAACATAT 
ATTCATGAAC 

2 0 TTGAAATGGA 

GACATTCCTG 
TGATTGGGAA 
TTCCAACTCC 
GATGTATTCT 
25 ACGTGGACAA 
AAGAAAGCAA 
CTAGACTACG 
TGCTCGTGAA 
TTACTCCACA 

3 0 GAAGGTGGAC 

TTTCCGTACT 
AAATGGTAAT 
CCAATCGCTA 



TTAGTAGTAT 
TCTTACTTTC 
AAATGTGACA 
AATTCGTGAT 
TAATCAAAGG 
GCTAAAATTG 
AGAACGTGAT 
CAATCACTGG 
GTTAAAGTTG 
AAAAGTAGTA 
CTGAAGCTGG 
GATATCCAAC 
TACTAACTTC 
GTCATACTCC 
ACTGACGTAA 
GCCTGGTGAT 
TCGAAGAC 



CTGCTGCTGA 
ACGTCAAGTT 
TGGTTGACGA 
CTATTAACTG 
TTCAGCTCTT 
ACGAGTTAAT 
ACTGACAAAC 
TCGTGGAACA 
GTGACGAAGT 
GTAACTGGAG 
CGACAACATT 
GTGGTCAAGT 
AAAGCTGAAA 
ATTCTTCAAC 
CTGGTATTGT 
AACATTGAGC 



TGGTCCAATG 
GGTGTTCCAT 
TGAAGAATTA 
AATATGAATT 
AAAGCACTTC 
GGAAGCTGTA 
CATTCATGAT 
GTTGCAACTG 
AGAAGTTATC 
TAGAAATGTT 
GGCGCACTTC 
ATTAGCTAAA 
CTTATGTTTT 
AACTACCGCC 
TACACTTCCA 
TTGCAGTTGA 



CCACAAACTC 
ACATCGTTGT 
CTTGAATTAG 
CCCTGGCGAC 
AAGGTGAAGC 
GATTCTTACA 
GCCAGTTGAG 
GACGTGTTGA 
GGTATTGAAG 
CCGTAAATTA 
TACGTGGTGT 
CCAGGTTCGA 
AACTAAAGAA 
CACAATTCTA 
GAAGGTACTG 
ACTAATTGCA 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
818 



35 



2) INFORMATION FOR SEQ ID NO: 1726 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



50 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria monocytogenes 

(B) STRAIN: LSPQ 5093202 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1726 



CGGAGCTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCACAAACTC 



50 
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15 



GTGAACATAT 
TTCATGAACA 
TGAAATGGAA 
ACATTCCTGT 
GACTGGGAAG 
TCCAACTCCW 
ATGTATTCTC 
CGTGGACAAG 
AGAAAGCAAA 
TAGACTACGC 
GCTCGTGAAG 
TACTCCACAC 
AAGGTGGACG 
TTCCGTACTA 
AATGGTAAYG 
CAATCGCTAT 



CTTACTTTCA 
AATGTGACAT 
ATTCGTGATC 
AATCAAAGGT 
CTAAAATTGA 
GAACGTGATA 
AATCACTGGT 
TTAAAGTTGG 
AAAGTAGTAG 
TGAAGCTGGC 
ATATCCAACR 
ACTAACTTCA 
TCACACTCCA 
CTGACGTAAC 
CCTGGTGATA 
CGAAGAC 



CGTCAAGTTG 
GGTTGACGAT 
TATTAACTGA 
TCAGCTCTTA 
CGAGTTAATG 
CTGACAAACC 
CGTGGAACAG 
TGACGAAGTA 
TAACTGGAGT 
GACAACATTG 
TGGTCAAGTA 
AAGCTGAAAC 
TTCTTCAACA 
TGGTATTGTT 
ACATTGAGCT 



GTGTTCCATA 
GAAGAATTAC 
ATATGAATTC 
AAGCACTTCA 
GAAGCTGTAG 
ATTCATGATG 
TTGCAACTGG 
GAAGTTATCG 
AGAAATGTTC 
GCGCACTTCT 
TTAGCTAAAC 
TTATGTTTTA 
ACTACCGCCC 
ACACTTCCAG 
TGCAGTTGAA 



CATCGTTGTA 
TAGAATTAGT 
CCTGGCGATG 
AGGTGAAGCT 
ATTCTTACAT 
CCAGTTGAGG 
ACGTGTTGAA 
GTATCGAAGA 
CGTAAATTAC 
ACGTGGTGTT 
CAGGTTCGAT 
ACTAAAGAAG 
ACAATTCTAT 
AAGGTACTGA 
CTAATTGCAC 



100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
817 



20 2) INFORMATION FOR SEQ ID NO: 1727 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



30 



35 



40 



45 



50 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria seeligeri 

(B) STRAIN: ATCC 35967 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1727 



CGGAGCTATC 
GTGAACATAT 
TTCATGAACA 
TGAAATGGAA 
ACATTCCTGT 
GACTGGGAAG 
TCCAACTCCA 
ATGTATTCTC 
CGTGGACAAG 
AGAAAGCAAA 
TAGACTACGC 
GCTCGTGAAG 
TACTCCACAT 
AAGGTGGACG 
TTCCGTACTA 
AATGGTAATG 
CAATCGCTAT 



TTAGTAGTAT 
CTTACTTTCA 
AATGTGACAT 
ATTCGTGATC 
AATCAAAGGT 
CTAAAATTGA 
GAACGTGATA 
AATCACTGGT 
TTAAAGTTGG 
AAAGTAATAG 
TGAAGCTGGC 
ATATCCAACG 
ACTAACTTCA 
TCACACTCCA 
CTGACGTAAC 
CCTGGTGATA 
CGAAGAC 



CTGCTGCTGA 
CGTCAAGTTG 
GGTTGACGAT 
TATTAACTGA 
TCAGCTCTTA 
CGAGTTAATG 
CTGACAAACC 
CGTGGAACTG 
TGACGAAGTA 
TAACTGGAGT 
GACAACATTG 
TGGTCAAGTA 
AAGCTGAAAC 
TTCTTCAACA 
TGGTATTGTT 
ACATTGAGCT 



TGGCCCAATG 
GTGTTCCATA 
GAAGAATTAC 
ATATGAATTC 
AAGCACTTCA 
GAAGCTGTAG 
ATTCATGATG 
TTGCAACTGG 
GAAGTTATCG 
AGAAATGTTC 
GCGCACTTCT 
TTAGCTAAAC 
TTATGTTTTA 
ACTACCGCCC 
ACACTTCCAG 
TGCAGTTGAA 



CCACAAACTC 
CATCGTTGTA 
TTGAATTAGT 
CCTGGTGATG 
AGGTGAAGCT 
ATTCTTACAT 
CCAGTTGAGG 
ACGTGTTGAA 
GTATTGAAGA 
CGTAAATTAC 
ACGTGGTGTT 
CAGGTTCGAT 
ACTAAAGAAG 
ACAATTCTAT 
AAGGTACTGA 
CTAATTGCAC 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: ATCC 25923 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1728 



CGGTGGTATC 
GTGAACACAT 
TTCTTAAACA 

2 0 AGAAATGGAA 

ATGTACCTGT 
CAATACGAAG 
TCCAACTCCA 
ACGTATTCTC 
25 CGTGGTCAAA 
CACATCTAAA 
ACTACGCTGA 
CGTGAAGACG 
ACCACATACT 

3 0 GTGGACGTCA 

CGTACTACTG 
GGTAATGCCT 
TCGCGATTGA 



TTAGTAGTAT 
TCTTTTATCA 
AAGTTGACAT 
GTTCGTGACT 
AATCGCTGGT 
AAAAAATCTT 
GAACGTGATT 
AATCACTGGT 
TCAAAGTTGG 
ACAACTGTTA 
AGCTGGTGAC 
TACAACGTGG 
GAATTCAAAG 
CACTCCATTC 
ACGTAACTGG 
GGTGATAACG 
AGAC 



CTGCTGCTGA 
CGTAACGTTG 
GGTTGACGAT 
TATTAAGCGA 
TCAGCATTAR 
AGAATTARTG 
CTGACAAACC 
CGTGGTACTG 
TGAAGAAGTT 
CAGGTGTTGA 
AACATTGGTG 
TCAAGTATTA 
CAGAAGTATA 
TTCTCAAACT 
TGTTGTTCAC 
TTGAAATGAC 



CGGTCCAATG 
GTGTACCAGC 
GAAGAATTAT 
ATATGACTTC 
AAGCTTTAGA 
GAAGCTGTAG 
ATTCATGATG 
TTGCTACAGG 
GAAATCATCG 
AATGTTCCGT 
CATTATTACG 
GCTGCTCCTG 
CGTATTATCA 
ATCGTCCACA 
TTACCAGAAG 
AGTAGAATTA 



CCACAAACTC 
ATTAGTAGTA 
TAGAATTAGT 
CCAGGTGACG 
AGGCGATGCT 
ATACTTACAT 
CCAGTTGAGG 
CCGTGTTGAA 
GTTTACATGA 
AAATTATTAG 
TGGTGTTGCT 
GTTCAATTAC 
AAAGACGAAG 
ATTCTATTTC 
GTACTGAAAT 
ATCGCTCCAA 
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2) INFORMATION FOR SEQ ID NO: 1729 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



50 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: ATCC 15305 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1729 



CGGAGCTATC TTAGTAGTAT CTGCTGCTGA TGGCCCAATG CCACAAACTC 



50 
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GTGAACACAT 


TPTTTTATPA 

X W X X X X ii X V^-t* 


PGTRAPPTTP 

x rvnv^vj x x vj 


GTGYTCCAGP 

VJ X VJ X X V-» V^il VJ V^- 


ATTZ\PTTPT A 
n x X nvj X x Vj 1 n 


i on 

X. \J \J 




TTCTTAAACA 


AAGTTGACAT 

XiTivJ X X VJ fluil X 


PGTTPAPPAY 

vjvj x iun\/Uni 


GAAGAATTAT 

vjnnvjnn x x n x 


TAP A ATTRPT 
1 nunn 1 1 r\vj X 


i so 

X J VJ 




AGAAATGGAA 


GTTCGTGRCT 

VJ X X \<f VJ X VJ X \ V— r X 


TATTAAPPPA 


ATATGACTTP 

n x n x x x v^ 


PPAPPTPAPP 
v^v^nvjvj x onvy vj 


?00 

^ V VJ 




ATGTACCTGT 

n x vj x nv^ >w x vj x 


AATCTCTGGT 

rvri ivi *w x vjvj x 


TPTGPATTAA 


AAGPTTTAGA 

nnvjv/ xxx nvjn 


APPPPAPPPT 
nvjvjVw-Vjnv^vjv*. x 


9 SO 
^ j \j 




PACTATGAGP 


AAAAAATCTT 

flflfMiTUl X V^ X X 


APAPTTAATP 


CAAGCTGTTP 

urvnvjV/ x vj x x vj 


ATPAPTYPAT 
n X vjriU X X v-»n X 


^00 

J V v> 




TPPAAPAPPA 


PAAPPTGATT 
onnv>\j x Oil x x 


PTPAPAAAPP 
v* x vjnVynnnv<v> 


ATTP ATP ATP 
n x x v^n. x vjx*. x vj 


PPAPTTPAPP 

V-rV,fiVj X X VJnVjVj 


*^so 




APPTATTPTP 
nV^ Vj lnl x***i^ 


AATPAPTPPT 

rin x unu x vjvj x 


PPTPPTAPTP 
v*» vj x vjvj 1 nvv x vj 


TTPPTAPAPP 
x x vjvs x n^nvjo 


PPPTPTTPAA 
Vrf>V^o lull unn 


4 00 




PPTPPTPAAA 
V—vj X vjvj l v^nnn 


TPAAAPTPPP 


TPAAPAAATP 
x VJrtnonrvrt x V*. 


PAR ATP ATPP 

unlvn X 1 V^VJ 


PT ATPPAAPA 


4 SO 

4 O VJ 




APAATPAAPP 
nVJn/i 1 V^nnvjv^ 


A A A APAAPTP 


TT APTPPTf^T 
X X nv^ X vjvj 1 vj 1 


APAAATPTTP 

nvjnnn 1 O 1 1 V> 


PPT A A ATT AT 
v^vj 1 nnn. 1 lnl 


S00 

D U VJ 


i n 
x. u 


TAPAPTAPPP 
x rivjnv^. 1 nk/UL* 


TPA APPTPPT 


PAP A AP ATTP 
Vjnv-.nnv^nX 1 Vj 


PTPPATTATT 
vj 1 vjv^fi i lnl i 


■ nv^Vj x Vjvj 1 vj X X 


S SO 

J JU 




TCACGTGATG 


ATGTACAALCj 


TGGTCAAGTT 


TTAGCTGCTC 


CTGGTACTAT 


600 




CACACCACAT 


ACAAAATTCA 


AAGCGGATGT 


TTACGTTTTA 


TCTAAAGATG 


650 




AAGGTGGTCG 


TCATACGCCA 


TTCTTCACTA 


ACTACCGCCC 


ACAATTCTAT 


700 




TTCCGTACTA 


CTGACGTAAC 


TGGTGTTGTT 


AACTTACCAG 


AAGGTACTGA 


750 


15 


AATGGTTATG 


CCTGGCGATA 


ACGTTGAAAT 


GGATGTTGAA 


TTAATTTCTC 


800 




CAATCGCTAT 


TGAAGAC 
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20 2) INFORMATION FOR SEQ ID NO: 1730 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



30 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus simulans 

(B) STRAIN: ATCC 27848 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1730 
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CGGCGGTATC 
GTGAACACAT 
TTCTTAAACA 
TGAAATGGAA 
ATGTACCAGT 
GAATACGAAC 
CCCAACTCCA 
ACGTATTCTC 
CGTGGTCAAA 
AGAAAGCAAG 
TAGACTACGC 
GCACGTGAAG 
TACTCCACAC 
AAGGTGGACG 
TTCCGTACTA 
AATGGTTATG 
CAATCGCGAT 



TTAGTAGTAT 
CTTATTATCA 
AAGCTGACAT 
GTTCGTGACT 
TATCGTTGGT 
AAAAAATCTT 
GAACGTGACT 
AATCACTGGT 
TCAAAGTCGG 
AAAACAACAG 
TGAAGCTGGT 
ACGTACAACG 
ACAAAATTCA 
TCATACTCCA 
CTGACGTAAC 
CCTGGCGATA 
TGAAGAC 



CTGCTGCAGA 
CGTAACGTTG 
GGTTGACGAC 
TATTATCTGA 
TCTGCATTAA 
AGACTTAATG 
CTGATAAACC 
CGTGGTACTG 
TGAAGAAGTT 
TTACAGGTGT 
GACAACATCG 
TGGACAAGTA 
AAGCTGATGT 
TTCTTCACTA 
TGGCGTTGTT 
ACGTAGAAAT 



TGGTCCAATG 
GTGTACCAGC 
GAAGAATTAT 
ATACGACTTC 
AAGCTTTAGA 
CAAGCTGTAG 
ATTCATGATG 
TAGCAACAGG 
GAAATCATCG 
AGAAATGTTC 
GTGCTTTATT 
TTAGCAGCTC 
TTACGTTTTA 
ACTACCGCCC 
CACTTACCAG 
GACTGTTGAA 



CCACAAACTC 
TTTAGTTGTA 
TAGAATTAGT 
CCTGGTGACG 
AGGCGACCCA 
ATGACTACAT 
CCAGTTGAGG 
CCGTGTTGAA 
GTATCACTGA 
CGTAAATTAT 
ACGTGGTGTT 
CTGGCTCTAT 
TCTAAAGAAG 
ACAATTCTAC 
AAGGTACTGA 
TTGATCGCTC 
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2) INFORMATION FOR SEQ ID NO: 1731 



PCT/CA00/01150 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 817 bases 
5 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA " 

10 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus agalactiae 

(B) STRAIN: ATCC 27591 

(C) ACCESSION NUMBER: 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1731 
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CGGAGCTATC 
GTGAGCACAT 
TTCATGAACA 
TGAAATGGAA 
ACCTTCCAGT 
AAATACGAAG 
TCCAGAACCA 
ATGTATTCTC 
CGTGGTACTG 
AGATATCCAA 
TTGACGAAGG 
CAACGTGATG 
CAACCCACAC 
AAGGTGGACG 
TTCCGTACAA 
AATGGTTATG 
CAATCGCCGT 



CTTGTAGTTG 
CCTTCTTTCA 
AAGTTGACCT 
ATTCGTGACC 
TATCCAAGGT 
ACATCATCAT 
GAACGTGATA 
AATCACTGGA 
TTCGTGTCAA 
AAAGCAGTTG 
TCTTGCAGGG 
AAATCGAACG 
ACTAAATTTA 
TCATACTCCA 
CTGACGTAAC 
CCTGGTGATA 
AGAACAA 



CTTCAACTGA 
CGTCAAGTTG 
TGTTGATGAT 
TTCTTTCAGA 
TCAGCTCTTA 
GGAATTGATG 
CTGACAAACC 
CGTGGTACAG 
CGACGAAGTT 
TTACTGGTGT 
GACAACGTTG 
TGGTCAAGTT 
AAGGTGAAGT 
TTCTTCAACA 
AGGTTCAATC 
ACGTTACTAT 



TGGACCAATG 
GTGTTAAACA 
GAAGAATTGC 
ATACGACTTC 
AAGCACTTGA 
AGCACTGTTG 
TTTACTTCTT 
TTGCTTCAGG 
GAAATCGTTG 
TGAAATGTTC 
GTGTTCTTCT 
CTTGCTAAAC 
TTACATCCTT 
ACTACCGTCC 
GAACTTCCAG 
CGAAGTTGAA 



CCACAAACTC 
CCTTATCGTA 
TTGAATTGGT 
CCAGGTGATG 
AGGCGACGAA 
ATGAGTACAT 
CCAGTTGAAG 
ACGTATCGAC 
GTATTAAAGA 
CGTAAACAAC 
TCGTGGTGTT 
CAGGTTCAAT 
TCTAAAGAAG 
ACAATTCTAC 
CAGGAACAGA 
TTGATTCACC 
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2) INFORMATION FOR SEQ ID NO: 1732 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 
50 (B) STRAIN: ATCC 27336 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1732 
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CGGAGCTATC 
GTGAGCACAT 
TTCATGAACA 
TGAAATGGAA 
ATCTTCCAGT 
AAATACGAAG 
CCCAGAACCA 
ACGTATTCTC 
CGTGGTATCG 
AGAAACTCRA 
TTGACGAAGG 
CAACGTGATG 
CAACCCACAC 
AAGGTGGACG 
TTCCGTACTA 
AATGGTAATG 
CAATCGCCGT 



CTTGTAGTAG 
CCTTCTTTCA 
AAGTTGACTT 
ATCCGTGACC 
TATCCAAGGT 
ACATCGTTAT 
GAACGTGACA 
AATCACTGGA 
TTAAAGTCAA 
AAAGCAGTTG 
TCTTGCTGGA 
AAATCGAACG 
ACTAAATTCA 
TCACACTCCA 
CTGACGTTAC 
CCTGGTGATA 
AGAACAA 



CTTCAACTGA 
CGTCAGGTTG 
GGTTGACGAC 
TATTGTCAGA 
TCAGCACTTA 
GGAATTGATG 
CTGACAAACC 
CGTGGTACAG 
CGACGAAATC 
TTACTGGTGT 
GATAACGTAG 
TGGACAAGTT 
AAGGTGAAGT 
TTCTTCAACA 
AGGTTCAATC 
ACGTGACAAT 



CGGACCAATG 
GTGTTAAACA 
GAAGAATTGC 
ATACGACTTC 
AAGCTCTTGA 
AACACAGTTG 
ATTGCTTCTT 
TTGCTTCAGG 
GAAATCGTTG 
TGAAATGTTC 
GTGTCCTTCT 
ATCGCTAAAC 
CTACATCCTT 
ACTACCGTCC 
GAACTTCCAG 
CGACGTTGAG 



CCACAAACTC 
CCTTATCGTC 
TTGAATTGGT 
CCAGGTGACG 
AGGTGACTCT 
ATGAGTATAT 
CCAGTCGAGG 
ACGTATCGAC 
GTATCAAAGA 
CGTAAACAAC 
TCGTGGTGTT 
CAGGTTCAAT 
ACTAAAGAAG 
ACAATTCTAC 
CAGGTACTGA 
TTGATTCACC 



2) INFORMATION FOR SEQ ID NO: 1733 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus sal 

(B) STRAIN: ATCC 7073 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 



CGGTGCGATC 
GTGAGCACAT 
TTCATGAACA 
TGAAATGGAA 
ACATTCCAGT 
AAATACGAAG 
CCCAGAACCA 
ACGTATTCTC 
CGTGGTGTTG 
AGACATCCAA 
TTGACGRAGG 
CAACGTGATG 
CAACCCACAC 
AAGGTGGACG 
TTCCGTACAA 
AATGGTTATG 
CAATCGCCGT 



CTTGTAGTAG 
CCTTCTTTCA 
AAGTTGACTT 
ATCCGTGACC 
TATCCAAGGT 
ACATCATCAT 
GAACGTGACA 
AATCACTGGT 
TTCGTGTCAA 
AAAGCAGTTG 
TATTGCCGGA 
AAATCGAACG 
ACTAAATTCA 
TCACACTCCA 
CTGACGTAAC 
CCTGGTGATA 
TGAACAA 



CATCTACTGA 
CGTCAGGTTG 
GGTTGACGAT 
TTCTTTCAGA 
TCAGCTCTTA 
GGACTTGATG 
CTGACAAACC 
CGTGGTACTG 
TGACGAAGTT 
TTACTGGTGT 
GATAACGTCG 
TGGTCAAGTA 
AAGGTGAAGT 
TTCTTCAACA 
AGGTTCAATC 
ACGTGACTAT 



902 



ivarius 



1733 

CGGACCAATG 
GTGTTAAACA 
GAAGAATTGC 
ATACGATTTC 
AAGCTCTTGA 
AACACTGTTG 
ATTGTTGCTT 
TTGCTTCAGG 
GAAATCGTTG 
TGAAATGTTC 
GTGTTCTTCT 
TTGGCTGCAC 
TTACATCCTT 
ACTACCGTCC 
GAACTTCCTG 
CGACGTTGAG 



CCACAAACTC 
CCTTATCGTC 
TTGAATTGGT 
CCAGGTGATG 
AGGTGATTCT 
ACGAATACAT 
CCAGTCGAAG 
ACGTATCGAC 
GTCTTAAAGA 
CGTAAACAAC 
TCGTGGTATC 
CTGGTTCAAT 
TCTAAAGAAG 
ACAGTTCTAC 
CAGGTACTGA 
TTGATCCACC 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



2) INFORMATION FOR SEQ ID NO: 1734 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 97 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



10 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Agrobacterium tumefaciens 
(C) ACCESSION NUMBER: x99673 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1734 
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AACATGATCA 
GGCTGCCGAC 
GTCAGGTGGG 
GTTGACGACG 
TCTGTCGTCC 
CGGCACTTGC 
ATCCGCGAGC 
TCCGATCGAC 
CGGGTCGTGG 
GTTGGTGAAG 
TGTTACCGGC 
GCGACAACAT 
CGTGGTCAGA 
CATGGCAGAA 
CGTTCTTCAC 
ACCGGTATCG 
CAACGTCACT 
AGCTGCGCTT 



CCGGTGCTGC 
GGCCCGATGC 
CGTTCCGGCC 
CCGAGCTTCT 
TACGACTTCC 
TGCTCTTGAA 
TGATGGCTGC 
CAGCCGTTCC 
TACGGTTGTG 
AAGTCGAAAT 
GTTGAAATGT 
CGGTGCACTC 
TCCTGTGCAA 
GCCTACATCC 
GAACTACCGT 
TTTCGCTTCC 
GTTGAAGTCG 
CGCTATCCGC 



CGAGATGGAC 
CACAGACCCG 
ATCGTCGTGT 
CGAGCTCGTC 
CGGGCGACGA 
GATTCTGACA 
TGTCGACGCC 
TGATGCCGAT 
ACGGGTCGCG 
CGTCGGCATC 
TCCGCAAGCT 
GTTCGCGGCG 
GCCGGGTTCG 
TGACGAAGGA 
CCGCAGTTCT 
TGAAGGCACG 
AGCTGATCGT 
GAAGGCGGCC 



GGCGCGATCC 
CGAGCACATC 
TCCTCAACAA 
GAGCTTGAAG 
TATCCCGATC 
AGAAGATCGG 
TACATCCCGA 
CGAAGACGTG 
TTGAGCGCGG 
CGTCCGACCT 
GCTCGACCAG 
TTACCCGTGA 
GTCAAGCCGC 
AGAAGGCGGC 
ACTTCCGTAC 
GAAATGGTTA 
TCCGATCGCG 
GTACCGTCGG 



TGGTTTGCTC 
CTGCTTGCCC 
GGTCGACCAG 
TTCGCGAACT 
ATCAAGGGTT 
TGAAGACGCG 
CGCCTGAGCG 
TTCTCGATCT 
TATCGTCAAG 
CGAAGACGAC 
GGCCAGGCCG 
CGGCGTCGAG 
ACAAGAAGTT 
CGTCATACGC 
GACTGACGTT 
TGCCTGGCGA 
ATGGAAGAAA 
CGCCGGC 
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2) INFORMATION FOR SEQ ID NO: 1735 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 885 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



50 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus subtilis 

(B) STRAIN: 168 

(C) ACCESSION NUMBER: Z99104 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1735 
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ATGATCACTG 
TGCTGATGGC 
ACGTTGGTGT 
GACGACGAAG 
TAGCGAATAC 
CTCTTAAAGC 
CTTATGGATG 
AAAACCATTC 
GTACAGTTGC 
GAAGTTGAAA 
AGGTGTTGAA 
ACATTGGTGC 
CAAGTACTTG 
TGAAGTTTAC 
TCTCTAACTA 
ATCATCCATC 
TGAAATGAAC 
GTTTCTCTAT 



GTGCTGCGCA 
CCAATGCCAC 
ACCATACATC 
AGCTTCTTGA 
GACTTCCCTG 
TCTTGAAGGA 
CGGTTGATGA 
ATGATGCCAG 
TACTGGCCGT 
TCATCGGTCT 
ATGTTCCGTA 
CCTTCTTCGC 
CTAAACCAGG 
GTTCTTTCTA 
CCGTCCTCAG 
TTCCAGAAGG 
GTTGAACTTA 
TCGTGAAGGC 



AATGGACGGA 
AAACTCGTGA 
GTTGTATTCT 
ACTAGTTGAA 
GTGATGATGT 
GACGCTGAGT 
GTACATCCCA 
TTGAGGACGT 
GTAGAACGCG 
TCAAGAAGAG 
AGCTTCTTGA 
GGTGTATCTC 
TACAATCACT 
AAGAAGAGGG 
TTCTACTTCC 
CGTAGAAATG 
TTTCTACAAT 
GGACGTACTG 



PPTATPPTTP 


I rio 1 1 L 1 




ftPAPATPPTT 


PTTTPT A A ft A 


i nn 


TAAAPAA ATfZ 




JL jU 


x\ X uunnu J. 


PPP ATPTTPT 




.rlLLnVj Hull 


a & Zl PPT TP TP 


z o u 


Hf^PA ACPTA A 


a atpttpph a 


JUU 


APTPP ZVPA AP 


oUaHLAL 1 VaH 


j DVJ 


aTTPTPZi ATP 


r\L 1 ola 1 Lb i Vj 


/inn 




I Lb-o 1 uHL 




rift K^riri oHflflft 


LAAL loll /iL 


OUU 


1 l.ttHjL 1 LjAA 


LjL 1 CjCj 1 IjAU A 


CCA 

OOU 


GTGAAGAAAT 


CCAACGTGGT 


600 


CCACACAGCA 


AATTCAAAGC 


650 


TGGACGTCAT 


ACTCCATTCT 


700 


GTACAACTGA 


CGTAACTGGT 


750 


GTTATGCCTG 


GAGATAACAC 


800 


CGCTATCGAA 


GAAGGAACTC 


850 


TTGGT 




885 
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2) INFORMATION FOR SEQ ID NO: 1736 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 882 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



35 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacteroides fragilis 

(B) STRAIN: DSM 2151 

(C) ACCESSION NUMBER: P33165 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1736 



40 ATGGTTACTG 
TACTGATGGT 
AGGTAAACGT 
GAAGATGCTG 
TTCATTCTAT 

45 CTCTTGGTGC 
CTGATGGAAG 
TAAACCTTTC 
GTACTGTAGC 
GAAATCGAAA 

50 TGTTGAAATG 
TAGGTCTGTT 
GTTCTTTGTA 
GGTTTATATC 



GTGCTGCTCA 
CCGATGCCTC 
TCCGAAGCTG 
AGATGTTGGA 
GATTTCGACG 
ATTGAACGGC 
CTGTTGATAC 
TTGATGCCGG 
TACAGGTCGT 
TCCTCGGTTT 
TTCCGCAAAC 
GCTTCGTGGT 
AACCGGGTCA 
CTGAAGAAAG 



GATGGACGGT 
AGACTCGTGA 
GTTGTATTCA 
ACTTGTTGAA 
GTGACAATAC 
GTAGAAAAAT 
TTGGATTCCA 
TAGAAGACGT 
ATCGAAACTG 
GGGTGAAGAT 
TTCTGGATCA 
GTTGACAAGA 
GATTAAACCT 
AAGAAGGTGG 



GCTATCATTG 
GCACATCCTT 
TGAACAAGTG 
ATGGAAATGA 
TCCGATCATT 
GGGAAGACAA 
CTGCCTCCGC 
GTTCTCTATC 
GTGTTATCCA 
AAGAAATCAG 
GGGTGAAGCT 
ACGAAATCAA 
CACTCTAAAT 
TCGTCACACT 



TAGTTGCTGC 
TTGGCTCGTC 
CGATATGGTT 
GAGAATTGCT 
CAGGGTTCTG 
AGTAATGGAA 
GCGATGTTGA 
ACAGGTCGTG 
TGTAGGTGAT 
TTGTAACAGG 
GGTGACAACG 
ACGTGGTATG 
TCAAAGCAGA 
CCATTCCATA 
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ACAAATATCG 
ATCACTCTTC 
TATCACTGTA 
TCGCTATCCG 



TCCTCAGTTC 
CGGAAGGAAC 
GAGTTGATCT 
CGAAGGTGGA 



TACCTGCGTA 
TGAAATGGTA 
ATCCGGTTGC 
CGTACAGTAG 



CTATGGACTG 
ATGCCGGGTG 
ACTGAACATC 
GT 



TACAGGTGAA 
ATAACGTAAC 
GGTCTTCGTT 



750 
800 
850 
882 



2) INFORMATION FOR SEQ ID NO: 1737 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 888 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

15 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Borrelia burgdorferi 
20 (B) STRAIN: U78183 

(C) ACCESSION NUMBER: U78183 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1737 



2 5 AATATGATTA 

TGCTGATAGT 
AAAGAATGGG 
GCAGATCCTG 
TGAAAAATAT 

3 0 TTGGGGCTAT 

GAACTTCTTG 
TGACAAGCCA 
GAGGCACTGT 
CAAGAAGTTG 

35 TGGTGTTGAA 
ATGTTGGTCT 
CAAGTTTTGT 
TTCAATTTAT 
TCCCAGGGTA 

40 GTTGTTGCTT 
TATTATTGTT 
TTGCTGTTCG 



CAGGAGCAGC 
GGTGCTGAGC 
AATAAAGAAA 
AACTTGTTGA 
GGCTTTTCAG 
GTCAAATCCA 
AATCTATGGA 
TTTTTGCTTG 
TGCTACTGGG 
AAATAGTTGG 
ATGTTCCAGA 
TCTTTTGAGA 
CAGCTCCAGG 
TGTTTGACTA 
TAGACCACAG 
TAGAGGGCAA 
GAGCTGATCT 
AGAAGGTGGA 



TCAAATGGAT 
CTCAAACAAA 
ATAATAGTTT 
GCTTGTTGAA 
CTGATACTCC 
GAAGATCCTG 
TAATTATTTT 
CTGTTGAAGA 
CGTATTGAAA 
AATTAAAGAA 
AAATTCTTGA 
GGCGTTGATA 
TACAATTACT 
AAGAAGAAGG 
TTCTTTTTTA 
AGAAATGGTT 
CTTCAATAGC 
AGAACCGTTG 



GCAGCGATAC 
AGAGCATTTG 
TTTTAAATAA 
GTTGAAGTTT 
AATAATCAAA 
AATCTACAAA 
GATCTTCCAG 
TGTATTTTCT 
GAGGTATTAT 
ACCAGAAAAA 
GCAAGGTCAA 
AAAAAGACAT 
CCACACAAGA 
CGGTAGGCAC 
GAACAACCGA 
ATGCCTGGTG 
TATGGATAAG 
CTTCAGGA 



TTTTAGTTGC 
CTTCTTGCTC 
ATTGGACTTA 
TAGAACTTGT 
GGTTCAGCTT 
ATGCGTTAAA 
AAAGAGATAT 
ATTTCAGGAA 
TAAAGTTGGT 
CTACTGTTAC 
GCAGGGGATA 
TGAGAGGGGG 
AATTTAAAGC 
AAGCCATTTT 
TGTTACTGGA 
ATAATGTTGA 
AATGTAGAAT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
888 



45 

2) INFORMATION FOR SEQ ID NO: 1738 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 894 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



905 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(ii) MOLECULE TYPE: Genomic DNA 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brevibacterium linens 

(B) STRAIN: DSM 20425 

(C) ACCESSION NUMBER: X76863 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1738 



10 AACATGATCA 
CGCTACCGAC 
GTCAGGTCGG 
GTCGATGACG 
GCTCTCGAGC 

15 CCGCTCTCAA 
GATCTCATGG 
CGACAAGCCG 
GTGGAACCGT 
GACGAAATCG 

20 CACCGCTATC 
AGAACGTCGG 
GGTCAGGTCA 
GGCTCAGGTC 
TCTACTCGAA 

25 GGTGTCATCA 
CACCGATATG 
TCCGCTTCGC 



CCGGTGCCGC 
GGACCGATGC 
CGTTCCCTAC 
AGGAGCTCCT 
CAGGACTTCG 
GGCGCTGGAA 
CTGCCGTCGA 
TTCCTCATGC 
CGTCACCGGT 
AAATCGTCGG 
GAGATGTTCC 
TCTGCTCCTC 
TCGTGAAGCC 
TACATCCTGA 
CTACCGTCCG 
CGCTGCCCGA 
TCGGTCGAGC 
AATCCGCGAA 



TCAGATGGAC 
CCCAGACCCG 
ATCGTCGTGG 
CGAGCTCGTC 
ACGGAGACAA 
GGCGACGAGA 
TGACAACGTT 
CCGTCGAGGA 
CGTGTCGAGC 
CATCAAGGAG 
GCAAGACCCT 
CGCGGCACCA 
GGGTTCGATC 
GCAAGGACGA 
CAGTTCTACT 
GGGCACCGAG 
TCATCCAGCC 
GGTGGCCGCA 



GGTGCGATCC 
TGAGCACGTG 
CTCTGAACAA 
GAATTCGAGG 
CGCTCCGGTC 
AGTGGGTCAA 
CCGGAGCCGG 
CGTCTTCACG 
GCGGCGTGCT 
AAGTCGTCCA 
GCCGGATGCC 
AGCGCGAGGA 
ACCCCGCACA 
GGGCGGACGT 
TCCGGACCAC 
ATGGTCATGC 
GATCGCTATG 
CCGTCGGCGC 



TCGTCGTCGC 
CTGCTCGCGC 
GTCCGACATG 
TCCGCGACCT 
ATTCCGGTGT 
GAGCGTTCAG 
AGCGCGATGT 
ATCACCGGTC 
CCTGCCTAAC 
AGACGACTGT 
CGTGCAGGTG 
TGTTGAGCGC 
CCAAGTTCGA 
CACAACCCGT 
GGACGTCACC 
CCGGCGACAA 
GAGGACCGCC 
CGGT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
894 



30 



2) INFORMATION FOR SEQ ID NO: 1739 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 888 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



45 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Chlamydia trachomatis 

(B) STRAIN: F/IC-Cal-13 

(C) ACCESSION NUMBER: L22216 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1739 



ATGATCACGG 
AGCAGACGGC 
50 AGGTTGGTGT 
GACGACGCTG 
GTCGAAGTAC 
CCAAGCTGGC 



GCGCAGCGCA 
CCGATGCCGC 
TCCGTACATC 
AACTGCTCGA 
GACTTCCCGG 
GCTGGAAGGC 



GATGGACGGC 
AAACGCGTGA 
ATCGTGTTCC 
GCTGGTCGAG 
GCGACGACAC 
GACACGGGCG 

906 



GCGATCCTGG 
GCACATCCTG 
TGAACAAGTG 
ATGGAAGTTC 
GCCGATCGTG 
AGCTGGGCGA 



TTTGCTCGGC 
CTGGCGCGTC 
CGACAGTGTG 
GCGAACTCCT 
AAGGGTTCGG 
AGTGGCGATC 



50 
100 
150 
200 
250 
300 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



ATGAGCCTGG CAGACGCGCT GGACACGTAC ATCCCGACGC CGGAGCGTGC 

AGTTGACGGC GCGTTCCTGA TGCCGGTGGA AGACGTGTTC TCGATCTCGG 

GCCGTGGTAC GGTGGTGACG GGTCGTGTCG AGCGCGGCAT CGTGAAGGTC 

GGCGAAGAAA TCGAAATCGT CGGTATCAAG CCGACGGTGA AGACGACCTG 

CACGGGCGTT GAAATGTTCC GCAAGCTGCT GGACCAAGGT CAGGCAGGCG 

ACAACGTCGG TATCCTGCTG CGCGGCACGA AGCGTGAAGA CGTGGAGCGT 

GGCCAGGTTC TGGCGAAGCC GGGTTCGATC ACGCCGCACA CGCACTTCAC 

GGCTGAAGTG TACGTGCTGA GCAAGGACGA AGGCGGCCGT CACACGCCGT 

TCTTCAACAA CTACCGTCCG CAGTTCTACT TCCGTACGAC GGACGTGACG 

GGCTCGATCG AGCTGCCGAA GGACAAGGAA ATGGTGATGC CGGGCGACAA 

CGTGTCGATC ACGGTGAAGC TGATTGCTCC GATCGCGATG GAAGAAGGTC 

TGCGCTTCGC AATCCGTGAA GGCGGCCGTA CGGTCGGC 



350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
888 



10 



15 



20 



25 



30 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1740 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fibrobacter succinogenes 

(B) STRAIN: S85 

(C) ACCESSION NUMBER: X76866 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1740 

AACATGGTGA CTGGTGCTGC TCAGATGGAC GGCGCTATCC TCGTTGTTGC 50 

CGCTACTGAC GGTCCGATGC CGCAGACTCG CGAACACATC CTTCTCGCTC 100 

ACCAGGTTGG CGTGCCGAAG ATCGTCGTGT TCATGAACAA GTGCGACATG 150 

GTTGACGATG CTGAAATTCT CGACCTCGTC GAAATGGAAG TTCGCGAACT 200 

CCTCTCCAAG TATGACTTCG ACGGTGACAA CACCCCGATC ATCCGTGGTT 250 

CCGCTCTCAA GGCCCTCGAA GGCGATCCGG AATACCAGGA CAAGGTCATG 300 

GAACTCATGA ACGCTTGCGA CGAATACATC CCGCTCCCGC AGCGCGATAC 350 

CGACAAGCCG TTCCTCATGC CGATCGAAGA CGTGTTCACG ATTACTGGCC 400 

GCGGCACTGT CGCTACTGGC CGTATCGAAC GCGGTGTCGT TCGCTTGAAC 4 50 

GACAAGGTTG AACGTATCGG TCTCGGTGAA ACCACCGAAT ACGTCATCAC 500 

CGGTGTTGAA ATGTTCCGTA AGCTCCTCGA CGACGCTCAG GCAGGTGACA 550 

ACGTTGGTCT CCTCCTCCGT GGTGCTGAAA AGAAGGACAT CGTCCGTGGC 600 

ATGGTTCTCG CAGCTCCGAA GTCTGTCACT CCGCACACCG AATTTAAGGC 650 

TGAAATCTAC GTTCTCACGA AGGACGAAGG TGGCCGTCAC ACGCCGTTCA 700 

TGAATGGCTA CCGTCCGCAG TTCTACTTCC GCACCACCGA CGTTACTGGT 7 50 

ACGATCCAGC TCCCGGAAGG TGTCGAAATG GTTACTCCGG GTGACACGGT 800 

CACGATCCAC GTGAACCTCA TCGCTCCGAT CGCTATGGAA AAGCAGCTCC 850 

GCTTCGCTAT CCGTGAAGGT GGACGTACTG TTGGTGCTGG C 8 91 



907 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

2) INFORMATION FOR SEQ ID NO: 1741 



PCT/CA00/01150 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 894 bases 
5 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Flavobacterium ferrugineum 

(B) STRAIN: DSM 13524 

(C) ACCESSION NUMBER: X76867 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1741 





AACATGATCA 


CCGGTGCTGC 


CCAGATGGAC 


GGTGCTATCT 


TAGTTGTGGC 


50 




TGCATCAGAC 


GGTCCTATGC 


CTCAAACAAA 


AGAACACATC 


CTGCTTGCTG 


100 


20 


CCCAGGTAGG 


TGTACCTAAA 


ATGGTTGTGT 


TTCTGAATAA 


AGTTGACCTC 


150 




GTTGACGACG 


AAGAGCTCCT 


GGAGCTGGTT 


GAGATCGAGG 


TTCGCGAAGA 


200 




ACTGACTAAA 


CGCGGTTTCG 


ACGGCGACAA 


CACTCCAATC 


ATCAAAGGTT 


250 




CCGCTACAGG 


CGCCCTCGCT 


GGTGAAGAAA 


AGTGGGTTAA 


AGAAATTGAA 


300 




AACCTGATGG 


ACGCTGTTGA 


CAGCTACATC 


CCACTGCCTC 


CTCGTCCGGT 


350 


25 


TGATCTGCCG 


TTCCTGATGA 


GCGTAGAGGA 


CGTATTCTCT 


ATCACTGGTC 


400 




GTGGTACTGT 


TGCTACCGGT 


CGTATCGAGC 


GTGGCCGTAT 


CAAAGTTGGT 


450 




GAGCCTGTTG 


AGATCGTAGG 


TCTGCAGGAG 


TCTCCCCTGA 


ACTCTACCGT 


500 




TACAGGTGTT 


GAGATGTTCC 


GCAAACTCCT 


CGACGAAGGT 


GAAGCTGGTG 


550 




ATAACGCCGG 


TCTCCTCCTC 


CGTGGTGTTG 


AAAAAACACA 


GATCCGTCGC 


600 


30 


GGTATGGTAA 


TCGTTAAACC 


CGGTTCCATC 


ACTCCGCACA 


CGGACTTCAA 


650 




AGGCGAAGTT 


TACGTACTGA 


GCAAAGACGA 


AGGTGGCCGT 


CACACTCCAT 


700 




TCTTCAACAA 


ATACCGTCCT 


CAATTCTACT 


TCCGTACAAC 


TGACGTTACA 


750 




GGTGAAGTAG 


AACTGAACGC 


AGGAACAGAA 


ATGGTTATGC 


CTGGTGATAA 


800 




CACCAACCTG 


ACCGTTAAAC 


TGATCCAACC 


GATCGCTATG 


GAAAAAGGTC 


850 


35 


TGAAATTCGC 


GATCCGCGAA 


GGTGGCCGTA 


CCGTAGGTGC 


AGGA 


894 



2) INFORMATION FOR SEQ ID NO: 1742 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 906 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Helicobacter pylori 

(B) STRAIN: 26695 

(C) ACCESSION NUMBER: AE000626 



908 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1742 



AACATGATCA 
TGCAGCTGAT 
5 GTCAAGTAGG 
GTAGATGACC 
GTTGAGCGCG 
CAGCTTTAAG 
GGTGAAAAAG 

1 0 TCCAGAAAGA 
TCTCTATTGC 
GTGGTGAAAG 
AAAAACGACT 
GTGAAGCCGG 

15 GAAGTGGAAC 
CAAGAAATTT 
GACACACTCC 
ACTGATGTGA 
GCCTGGCGAT 

2 0 TAGAGTTGGG 
GCTGGT 



CCGGTGCGGC 
GGCCCTATGC 
CGTGCCTCAC 
AAGAATTGTT 
TATGAATTTC 
AGCTTTAGAA 
TGCTTAAACT 
GACACTGAAA 
GGGTAGAGGG 
TAGGCGATGA 
GTAACCGGTG 
CGATAATGTG 
GCGGTATGGT 
GAGGGAGAAA 
ATTCTTCACC 
CTGGCTCTAT 
AATGTGAAAA 
AACTAAATTT 



GCAAATGGAC 
CTCAAACTAG 
ATCGTTGTTT 
AGAACTTGTA 
CTGGCGATGA 
GAAGCAAAGG 
TATGGCTGAA 
AAACTTTCTT 
ACTGTGGTTA 
AGTGGAAATC 
TAGAAATGTT 
GGCGTGCTTT 
TCTATGCAAA 
TTTATGTCCT 
AATTACCGCC 
CACCCTTCCT 
TCACTGTAGA 
GCGATTCGTG 



GGAGCGATTT 
GGAGCATATC 
TCTTAAACAA 
GAAATGGAAG 
CACTCCTATC 
CTGGTAATGT 
GTGGATGCCT 
GATGCCGGTT 
CAGGTAGGAT 
GTTGGTATCA 
TAGGAAAGAG 
TGAGAGGAAC 
CCAGGTTCTA 
TTCTAAAGAA 
CGCAATTCTA 
GAAGGCGTAG 
GTTGATTAGC 
AAGGCGGTAG 



TGGTTGTTTC 
TTATTGTCTC 
ACAAGACATG 
TGCGCGAATT 
GTAGCGGGTT 
GGGTGAATGG 
ATATCCCTAC 
GAAGATGTGT 
TGAAAGAGGC 
GACCTACACA 
TTGGAAAAAG 
TAAAAAAGAA 
TCACTCCGCA 
GAAGGCGGGA 
TGTGCGCACA 
AAATGGTTAT 
CCTGTTGCGT 
GACCGTTGGT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
906 



25 2) INFORMATION FOR SEQ ID NO: 1743 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



35 



(vi) ORIGINAL SOURCE : 

(A) ORGANISM: Micrococcus luteus 

(B) STRAIN: IFO 3333 

(C) ACCESSION NUMBER: M17788 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1743 



AACATGATCA 
CGCTACCGAC 
GCCAGGTCGG 

4 5 GTGGAGGACG 
GCTGTCCTCC 
CCGCTCTGAA 
GACCTCATGG 
GGACAAGCCG 

50 GTGGCACCGT 
TCCGAGGTCG 
CGGCATCGAG 
ACTGCGGTCT 



CCGGCGCCGC 
GGCCCGATGG 
CGTGCCGGCC 
AGGAGCTCCT 
AGGAGCTTCG 
GGCCCTCGAG 
ATGCCGTGGA 
TTCCTGATGC 
GGTGACCGGT 
AGATCGTCGG 
ATGTTCCACA 
GCTCGTGCGC 



TCAGATGGAC 
CCCAGACCCG 
CTGCTCGTGG 
CGAGCGTGTC 
ACGTCGACGA 
GGCGACCCCC 
CGAGTACATC 
CGATCGAGGA 
CGCGCCGAGC 
CATCCGCGAC 
AGCAGCTCGA 
GGTCTGAAGC 



GGCGCGATCC 
TGAGCACGTG 
CCCTGAACAA 
GAGATGGAGG 
GGCCCCGGTC 
AGTGGGTCAA 
CCGGACCCGG 
CGTCTTCACG 
GCGGCACCCT 
GTGCAGAAGA 
CGAGGCCTGG 
GCGACGACGT 



TCGTGGTCGC 
CTCCTGGCCC 
GTCGGACATG 
TCCGGCAGCT 
ATCCGCACCT 
GTCCGTCGAG 
TGCGCGACAA 
ATCACCGGCC 
GAAGATCAAC 
CCACTGTCAC 
GCCGGCGAGA 
CGAGCGCGGC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 



909 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

CAGGTGCTGG TGGAGCCGGG CTCCATCACC 

GAACGTCTAC ATCCTGTCCA AGGACGAGGG 

ACTCGAACTA CCGCGCGCAG TTCTACTTCC 

GTCATCACGC TGCCCGAGGG CACCGAGATG 

5 CGAGATGTCG GTCGAGCTCA * TCCAGCCGAT 

GCTTCGCCAT CCGCGAGGGT GGCCGCACCG 



PCT/CAOO/01150 

CCGCACACCA ACTTCGAGGC 650 

TGGGCGTCAC ACCCCGTTCT 700 

GCACCACCGA CGTCACCGGC 750 

GTCATGCCCG GCGACACCAC 800 

CGCCATGGAG GAGGGCCTCG 850 

TGGGCTCCGG C 891 



10 2) INFORMATION FOR SEQ ID NO: 1744 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(B) STRAIN: Erdmann 

(C) ACCESSION NUMBER: X63539 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1744 



30 



35 



40 



AACATGATCA 
CGCCACCGAC 
GTCAAGTGGG 
GTGGACGACG 
GCTGGCTGCC 
CGCTCAAGGC 
CTGATGAACG 
CAAGCCGTTC 
GAACCGTGGT 
GAAGTTGAGA 
CGGTGTGGAG 
ACGTTGGTTT 
CAGGTTGTCA 
CCAGGTCTAC 
TCAACAACTA 
GTGGTGACAC 
CAACATCTCG 
GTTTCGCGAT 



CCGGCGCCGC 
GGCCCGATGC 
TGTGCCCTAC 
AGGAGCTGCT 
CAGGAATTCG 
GCTCGAGGGT 
CGGTCGACGA 
CTGATGCCGG 
CACCGGACGT 
TCGTCGGCAT 
ATGTTCCGCA 
GCTGCTGCGG 
CCAAGCCCGG 
ATCCTGTCCA 
CCGTCCGCAG 
TGCCGGAGGG 
GTGAAGTTGA 
CCGCGAGGGT 



GCAGATGGAC 
CCCAGACCCG 
ATCCTGGTAG 
CGAACTCGTC 
ACGAGGACGC 
GACGCGAAGT 
GTCGATTCCG 
TCGAGGACGT 
GTGGAGCGCG 
TCGCCCATCG 
AGCTGCTCGA 
GGCGTCAAGC 
CACCACCACG 
AGGACGAGGG 
TTCTACTTCC 
CACCGAGATG 
TCCAGCCCGT 
GGCCGCACCG 



GGTGCGATCC 
CGAGCACGTT 
CGCTGAACAA 
GAGATGGAGG 
CCCGGTTGTG 
GGGTTGCCTC 
GACCCGGTCC 
CTTCACCATT 
GCGTGATCAA 
ACCACCAAGA 
CCAGGGCCAG 
GCGAGGACGT 
CCGCACACCG 
CGGCCGGCAC 
GCACCACCGA 
GTGATGCCCG 
CGCCATGGAC 
TGGGCGCCGG 



TGGTGGTCGC 
CTGCTGGCGC 
GGCCGACGCA 
TCCGCGAGCT 
CGGGTCTCGG 
TGTCGAGGAA 
GCGAGACCGA 
ACCGGCCGCG 
CGTGAACGAG 
CCACCGTCAC 
GCGGGCGACA 
CGAGCGTGGC 
AGTTCGAAGG 
ACGCCGTTCT 
CGTGACCGGT 
GTGACAACAC 
GAAGGTCTGC 
C 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
891 



45 



2) INFORMATION FOR SEQ ID NO: 1745 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

910 

SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(D) TOPOLOGY: Linear 



PCT/CA00/01150 



(ii) MOLECULE TYPE: Genomic DNA 



10 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycoplasma genltalium 

(B) STRAIN: G37 

(C) ACCESSION NUMBER: U39732 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1745 



15 



20 



25 



AATATGATCA 
AGCAACTGAT 
GCCAAGTAGG 
GCTAGTGATG 
GTTAACTTCC 
CAGCTTTAAA 
GATTTGATTA 
AGATAAACCT 
GAGGTACAGT 
CAAGAAGTTG 
TGGAATTGAA 
ATGCTGGGGT 
CAAGTTTTAG 
TGAGATCTAT 
TAAACGGTTA 
TCTATTGCTT 
TTCTATTACT 
AGTTCTCAAT 



CAGGTGCTGC 
AGTGTGATGC 
GGTTCCTAAA 
AAGAGGTACA 
TATGGTTTTG 
AGCATTGGAA 
AAGCAGTTGA 
TTCTTATTAG 
TGTTACAGGA 
AAATTGTTGG 
ATGTTCAAAA 
ATTATTACGT 
CAAAACCAGG 
GCTTTAAAGA 
CCGTCCTCAA 
TAGCTGAAAA 
GTTGAGTTAA 
TCGTGAAGGT 



ACAAATGGAT 
CCCAAACCCG 
ATGGTAGTTT 
AGAACTTGTT 
ATGGTAAGAA 
GGTGATCCAA 
TGAATGGATT 
CAATTGAAGA 
AGAGTTGAAA 
TTTAAAACCA 
AGGAACTTGA 
GGTGTTGAAC 
CTCTATTAAA 
AAGAAGAAGG 
TTCTATTTCC 
TACTGAAATG 
TTGCTCCTAT 
GGTAGAACTG 



GGAGCTATTC 
CGAGCACATC 
TTCTAAACAA 
GCTGAAGAAG 
CACTCCTATT 
AGTGGGAGGC 
CCAACTCCTA 
TACGATGACC 
GAGGTGAACT 
ATTAGAAAAG 
TTCAGCAATG 
GTAAAGAAGT 
CCGCACAAGA 
TGGTAGACAC 
GTACCACTGA 
GTTCTACCTG 
CGCTTGTGAA 
TAGGGGCAGG 



TAGTTGTTTC 
TTACTTGCCC 
GTGTGATATT 
TACGTGATCT 
ATTTATGGCT 
TAAGATCCAT 
CACGTGAAGT' 
ATTACTGGTA 
CAAAGTAGGT 
CAGTTGTTAC 
GCTGGTGACA 
TGAAAGAGGT 
AATTTAAAGC 
ACTGGTTTTT 
TGTAACTGGT 
GTGATAATGC 
AAAGGTAGTA 
C 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
891 



30 



2) INFORMATION FOR SEQ ID NO: 1746 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 91 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



45 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria gonorrhoeae 

(B) STRAIN: MS11 

(C) ACCESSION NUMBER: L36380 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1746 



50 AACATGATTA 
TGCTGCCGAC 
GTCAAGTAGG 
GTCGACGATG 



CCGGCGCCGC 
GGCCCTATGC 
CGTACCTTAC 
CCGAGCTGTT 



ACAAATGGAC 
CGCAAACCCG 
ATCATCGTGT 
CCAACTGGTT 



GGTGCAATCC 
CGAACACATC 
TCATGAACAA 
GAAATGGAAA 



TGGTATGTTC 
CTGCTGGCCC 
ATGCGACATG 
TCCGCGACCT 



50 
100 
150 
200 



911 
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10 



15 



GCTGTCCAGC 
CCGCACTGAA 
GAACTGGCTA 
GGACAAACCA 
GCGGTACCGT 
GACGAGATTG 
CGGCGTTGAA 
ACGTAGGCGT 
CAGGTATTGG 
AGAAGTGTAC 
TCGCCAACTA 
ACGATTACTT 
AACCATTACT 
GCTTTGCGAT 



TACGACTTCC 
AGCCTTGGAA 
CCGCATTGGA 
TTCCTGCTGC 
AGTCACCGGC 
AAATCGTCGG 
ATGTTCCGCA 
ATTGCTGCGC 
CCAAACGGGG 
GTATTGAGCA 
CCGTCCCCAA 
TGGAAAAAGG 
GTAGAACTGA 
TCGCGAAGGC 



CCGGCGACGA 
GGCGATGCCG 
C AG AT AC AT C 
CTATCGAAGA 
CGTGTAGAGC 
TCTGAAAGAA 
AACTGCTGGA 
GGTACCAAAC 
TACTATCACT 
AAGAAGAGGG 
TTCTACTTCC 
TGTGGAAATG 
TTGCGCCTAT 
GGCCGTACCG 



CTGCCCGATC 
CTTACGAAGA 
CCGACTCCCG 
CGTGTTCTCC 
GAGGTATCAT 
ACCCAAAAAA 
CGAAGGTCAG 
GTGAAGACGT 
CCTCACACCA 
CGGCCCCCAT 
GTACCACTGA 
GTAATGCCGG 
CGCTATGGAA 
TGGGTGCCGG 



GTACAAGGTT 
AAAAATCTTC 
AGCGTGCCGT 
ATTTCCGGCC 
CCACGTTGGT 
CCACCTGTAC 
GCGGGCGACA 
AGAACGCGGT 
AGTTCAAAGC 
ACCCCGTTTT 
CGTAACCGGC 
GTGAGAACGT 
GAAGGTCTGC 
C 



250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
891 



2) INFORMATION FOR SEQ ID NO: 1747 



20 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



30 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rickettsia prowazekii 

(B) STRAIN: Madrid E 

(C) ACCESSION NUMBER: Z54170 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1747 



3 5 AATATGATAA 
TGCTGCTGAT 
AACAGGTAGG 
GTAGATGATC 
ATTATCAAAA 

40 CTGCACTTCA 
GAGTTAATGA 
AGATAAACCT 
GAGGTACCGT 
GAAGAAATTG 

45 AGGTGTAGAA 
ATGTCGGTAT 
CAAGTACTTG 
TGAAGTGTAT 
CTAATGATTA 

50 ACAATAAAAT 
TACTTTTTCA 
AATTCTCTAT 



CTGGTGCCGC 
GGTCCTATGC 
TGTACCTGCT 
CTGACCTATT 
TATGGTTTCC 
AGCTTTAGAA 
ATGCAGTAGA 
TTTTTAATGC 
TGTAACTGGT 
AAATAGTAGG 
ATGTTCAGAA 
ATTACTACGT 
CAAAACCTGG 
GTGCTTAGTA 
TCGCCCACAG 
TGCCTTCTGA 
GTAGAATTAA 
ACGTGAAGGT 



TCAGATGGAT 
CTCAAACTAG 
ATGGTAGTAT 
AGAATTAGTT 
CTGGTAATGA 
GGAAAACCTG 
TACGTATATA 
CAATAGAGGA 
AGAGTGGAGT 
TCTAAAAAAT 
AATTACTTGA 
GGTACAAAAA 
GAGCATAAAA 
AAGAGGAAGG 
TTCTATTTTA 
TAAGCAGATG 
TTAAGCCGAT 
GGTAGAACAG 



GGTGCTATAT 
AGAACATATA 
TTTTGAATAA 
GAGATGGAAG 
AATACCTATT 
AAGGTGAAAA 
CCTCAGCCTA 
TGTATTTTCT 
CAGGCATAAT 
ACGCAAAAAA 
TGAAGGACAA 
GAGAAGAAGT 
CCGCATGATA 
TGGACGTCAT 
GAACAACAGA 
GTTATGCCTG 
TGCTATGCAA 
TAGGAGCCGG 



TAGTAGTTTC 
TTACTGGCAA 
AGTAGATATG 
TAAGAGAATT 
ATTAAAGGTT 
AGCTATTAAT 
TAGAGCTACA 
ATTTCAGGCA 
TAAGGTGGGT 
CGACTTGTAC 
TCTGGAGATA 
AGAAAGAGGA 
AATTTGAAGC 
ACCCCATTTA 
TGTTACCGGC 
GAGATAATGC 
GAAGGGTTAA 
T 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
891 
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10 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. 
serotype Typhimurium 

(B) STRAIN: LT2 trpE91 

(C) ACCESSION NUMBER: X55116 

(xi) SEQUENCE DESCRIPTION:, SEQ ID NO: 1748 



choleraesuis 





AACATGATCA 


CCGGTGCTGC 


TCAGATGGAC 


GGCGCGATCC 


TGGTTGTTGC 


50 


20 


TGCGACTGAC 


GGCCCGATGC 


CGCAGACCCG 


TGAGCACATC 


CTGCTGGGTC 


100 




GTCAGGTAGG 


CGTTCCGTAC 


ATCATCGTGT 


TCCTGAACAA 


ATGCGACATG 


150 




GTTGATGACG 


AAGAGCTGCT 


GGAACTGGTT 


G AG AT GG AAG 


TTCGCGAACT 


200 




GCTGTCTCAG 


TACGACTTCC 


CGGGCGACGA 


CACTCCGATC 


GTTCGTGGTT 


250 




CTGCTCTGAA 


AGCGCTGGAA 


GGCGACGCAG 


AGTGGGAAGC 


GAAAATCATC 


300 


25 


GAACTGGCTG 


GCTTCCTGGA 


TTCTTATATT 


CCGGAACCAG 


AGCGTGCGAT 


350 




TGACAAGCCG 


TTCCTGCTGC 


CGATCGAAGA 


CGTATTCTCC 


ATCTCCGGTC 


400 




GTGGTACCGT 


TGTTACCGGT 


CGTGTAGAGC 


GCGGTATCAT 


CAAAGTGGGC 


450 




GAAGAAGTTG 


AAATCGTTGG 


TATCAAAGAG 


ACTCAGAAGT 


CTACCTGTAC 


500 




TGGCGTTGAA 


ATGTTCCGCA 


AACTGCTGGA 


CGAAGGCCGT 


GCCGGTGAGA 


550 


30 


ACGTAGGTGT 


TCTGCTGCGT 


GGTATCAAAC 


GTGAAGAAAT 


CGAACGTGGT 


600 




CAGGTACTGG 


CTAAGCCGGG 


CACCATCAAG 


CCGCACACCA 


AGTTCGAATC 


650 




TGAAGTGTAC 


ATTCTGTCCA 


AAGATGAAGG 


CGGCCGTCAT 


ACTCCGTTCT 


700 




TCAAAGGCTA 


CCGTCCGCAG 


TTCTACTTCC 


GTACTACTGA 


CGTGACTGGT 


750 




ACCATCGAAC 


TGCCGGAAGG 


CGTAGAGATG 


GTAATGCCGG 


GCGACAACAT 


800 


35 


CAAAATGGTT 


GTTACCCTGA 


TCCACCCGAT 


CGCGATGGAC 


GACGGTCTGC 


850 




GTTTCGCAAT 


CCGTGAAGGC 


GGCCGTACCG 


TTGGCGCGGG 


C 


891 



40 2) INFORMATION FOR SEQ ID NO: 1749 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 881 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shewanella putrefaciens 

(B) STRAIN: DSM 50426 



913 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1749 



PCT/CA00/01150 



ATGATCACTG 
AACAGACGGT 
5 AGGTTGGCGT 
GATGACGAAG 
ATCAGAATAC 
CTCTGAAAGC 
TTAGCAGCGG 

10 TAAGCCGTTC 
GTACAGTAGT 
GAAGTTGAAA 
TGTAGAAATG 
GTGGTATTTT 

15 GTATTAGCGA 
AGTTTACGTA 
AAGGCTACCG 
ATCGAACTGC 
GATGGTAGTG 

20 TCGCAATCCG 



GTGCTGCACA 
CCAATGCCAC 
ACCATTCATC 
AGCTGTTAGA 
GATTTCCCAG 
GCTAGAAGGC 
CGCTGGATTC 
CTACTGCCAA 
AACAGGTCGT 
TCGTTGGTGT 
TTCCGTAAAC 
GTTACGTGGT 
AGCCAGGTTC 
CTGTCAAAAG 
TCCACAGTTC 
CAGAAGGCGT 
ACACTGATTT 
TGAAGGCGGT 



GATGGACGGC 
AGACTCGTGA 
ATCGTATTCA 
GCTAGTTGAG 
GTGATGACTT 
GAGCCAGAGT 
TTACATTCCA 
TCGAAGACGT 
GTTGAGCGTG 
ACGTGCGACA 
TGCTTGACGA 
ACTAAGCGTG 
AATCAACCCA 
AAGAAGGTGG 
TACTTCCGTA 
AGAGATGGTA 
GCCCAATCGC 
CGTACAGTGG 



GCGATTCTGG 
GCACATCCTG 
TGAACAAATG 
ATGGAAGTGC 
ACCGGTAATC 
GGGAAGCAAA 
GAACCACAAC 
ATTCTCAATT 
GTATTGTACG 
ACTAAGACAA 
AGGTCGTGCA 
ATGACGTAGA 
CACACTACTT 
TCGTCACACG 
CAACTGACGT 
ATGCCAGGCG 
GATGGACGAA 
T 



TAGTCGCTTC 
CTTTCTCGTC 
TGACATGGTA 
GTGAACTGTT 
CAAGGTTCAG 
AATCCTTGAA 
GTGACATCGA 
TCAGGCCGTG 
CGTAGGCGAC 
CGTGTACTGG 
GGTGAGAACT 
ACGTGGTCAA 
TTGAATCAGA 
CCATTCTTCA 
AACCGGTACT 
ATAACATCAA 
GGTTTACGCT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
881 



25 



30 



2) INFORMATION FOR SEQ ID NO: 1750 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 97 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



35 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Stigmatella aurantiaca 

(B) STRAIN: DW4 

(C) ACCESSION NUMBER: X82820 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1750 



40 



45 



50 



AACATGATCA 
CGCGGCCGAC 
GGCAGGTGGG 
CTGGACGATC 
GCTCAAGAAG 
GCGCGCTCAA 
ATCCTGAAGC 
TGCGACGGAC 
CAGGCCGAGG 
GTGGGCGAGG 
CATCACGGGG 
GAGACAACAT 
CGTGGGCAGG 



CGGGCGCGGC 
GGCCCGATGC 
CGTGCCCTAC 
CGGAGCTGCG 
TACGAGTTCC 
GGCGCTGGAG 
TGATGGCGGC 
AAGCCGTTCC 
AACGGTGGCG 
AAGTGGAGAT 
GTGGAGATGT 
CGGAGCGCTG 
TGCTGGCGAA 



GCAGATGGAC 
CCCAGACGCG 
ATCGTCGTCT 
CGAGCTGGTG 
CGGGCGACAG 
GGAGACACCA 
GGTGGACGAG 
TGATGCCGGT 
ACGGGCCGAG 
CGTGGGGATC 
TCCGCAAGCT 
CTGCGAGGCC 
CTGGGGGAGC 



GGAGCGATTC 
TGAGCACATC 
TCCTGAACAA 
GAGATGGAGG 
CATCCCCATC 
GCGACATCGG 
TACATCCCGA 
GGAAGACGTG 
TGGAGCGCGG 
CGTCCGACGC 
GCTGGACGAG 
TGAAGCGCGA 
ATCAACCCGC 



TGGTGGTGTC 
CTGCTGGCCA 
GGTGGACATG 
TGCGCGACCT 
ATCCCTGGCA 
CGAGGGAGCG 
CGCCGCAGCG 
TTCTCCATCG 
CAAGATCAAG 
AGAAGACGGT 
GGCATGGCGG 
GGACCTGGAG 
ACACGAAGTT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
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CAAGGCGCAG 
CGTTCTTCAA 
ACCGGAACGG 
CAACATCGCC 
AGCTGCCGTT 



GTGTACGTGC 
GGGATACCGG 
TGAAGCTGCC 
ATCGAGGTGG 
CGCCATCCGT 



TGTCGAAGGA 
CCGCAGTTCT 
GGACAACGTG 
AGCTCATTAC 
GAGGGTGGCC 



AGAGGGAGGG 
ACTTCCGGAC 
GAGATGGTGA 
TCCGGTCGCC 
GCACGGTGGG 



CGGCACACGC 
GACGGACGTG 
TGCCGGGAGA 
AT GG AG AAGG 
CGCCGGC 



700 
750 
800 
850 
897 



10 



15 



2) INFORMATION FOR SEQ ID NO: 1751 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 897 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear ' 

(ii) MOLECULE TYPE: Genomic DNA 



20 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Thiomonas cuprina 

(B) STRAIN: Hoe5 

(C) ACCESSION NUMBER: x76871 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1751 



25 



30 



35 



40 



AACATGATCA 
CGCCGCCGAC 
GTCAGGTGGG 
GTCGACGACG 
GCTGTCCAAG 
CGGCCAAGCT 
ATTCTCAAGC 
GGCCGTCGAC 
CCGGGCGCGG 
GTCGGCGAGG 
CTGCACCGGC 
GCGACAACGT 
CGCGGCCAGG 
CACCGCCGAG 
CCTTCTTCAA 
ACCGGCGCCA 
TAATGTGAGC 
GCCTGCGCTT 



CCGGTGCGGC 
GGCCCCATGC 
CGTGCCCTAC 
CCGAGCTGCT 
TACGACTTCC 
GGCCCTCGAA 
TGGCCGAGGC 
GGCGCGTTCC 
CACGGTGGTC 
AAATCGAGAT 
GTGGAAATGT 
CGGCATCTTG 
TGCTGTGCAA 
GTGTACGTGC 
CAACTACCGC 
TCGAACTGCC 
ATCACCGTCA 
CGCCATCCGC 



CCAGATGGAC 
CCCAAACCCG 
ATCATCGTGT 
CGAACTCGTC 
CCGGTGACGA 
GGCGACAAGG 
CCTGGACACC 
TCATGCCCGT 
ACCGGGCGTG 
TGTCGGCCTC 
TCAGGAAGCT 
CTGCGCGGCA 
ACCCGGCTCG 
TGAGCAAGGA 
CCGCAGTTCT 
CAAGGACAAG 
AGCTCATCGC 
GAAGGCGGCC 



GGCGCCATCC 
CGAGCACATC 
TCCTCAACAA 
GAGATGGAAG 
CACCCCCATC 
GCGAACTGGG 
TACATCCCCA 
GGAAGACGTG 
TGGAGCGCGG 
AAGCCCACCC 
GCTCGACCAG 
CCAAGCGCGA 
ATCAAGCCCC 
CGAGGGCGGC 
ACTTCCGCAC 
GAAATGGTCA 
CCCCATCGCC 
GCACCGTCGG 



TGGTCGTGTC 
CTGCTGGCGC 
GTGCGACATG 
TGCGCGAGCT 
ATCAAGGGCT 
CGAAGGCGCC 
CGCCCGAGCG 
TTCTCCATCT 
CATCATCAAG 
TCAAGACCAC 
GGCCAGGCCG 
GGAAGTCGAG 
ACACCCACTT 
CGCCACACCC 
CACCGACGTC 
TGCCCGGCGA 
ATGGAAGAAG 
CGCCGGC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
897 



45 



2) INFORMATION FOR SEQ ID NO: 1752 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



915 
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20 



25 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Treponema pallidum 

(B) STRAIN: Nichols 

(C) ACCESSION NUMBER: AE001202 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1752 



AATATGATCA 
TGCGCCTGAC 
GTCAGGTTGG 
GTTGATGATC 
GCTTGCTGGA 
CGTTTAAAGC 
GAGGAACTGC 
CGACGCAAGA 
GGCGTGGTAC 
AATGAAGAGG 
TACTGGCATT 
ATAACGTGGG 
GGTCAGGTGC 
GGCGCAGATC 
TTTTTCAAGG 
GGTACGATTT 
CACCAAGATT 
TGAAGCTTGC 



CGGGTGCTGC 
GGCGTTATGC 
TGTTCCCTCC 
CTGAGTTGCT 
TATGGGTTTT 
TCTGCAGGAT 
TTGCGGCCAT 
CCTTTCTTGC 
CGTTGTCACG 
TCGAGATCGT 
GAGATGTTTA 
GCTGCTTTTG 
TTTCTAAGCC 
TACGTGCTCT 
TTATCGTCCG 
CTCTTCCTGA 
ATAGGTGAGC 
GATTCGTGAA 



GCAGATGGAC 
CACAGACGAA 
ATCATTGTTT 
AGAGCTGGTG 
CGCGTGAGAC 
GGCGCTTCCC 
GGATTCCTAC 
TCTCTATCGA 
GGGCGCATCG 
CGGGATTAAG 
ATAAGTTGCT 
CGCGGGGTGG 
CGGTTCTATT 
CTAAGGAAGA 
CAGTTTTATT 
AGGGGTAGAC 
TCATCCACCC 
GGGGGGCGCA 



GGTGGTATTC 
GGAGCATCTT 
TTTTGAACAA 
GAAGAAGAGG 
GCCTATCGTC 
CGGAGGATGC 
TTTGAAGACC 
GGATGTGTAC 
AATGTGGGGT 
CCCACTAAGA 
TGATCAGGGA 
ATAAAAAAGA 
AAGCCACACA 
GGGTGGCCGT 
TTAGAACTAC 
ATGGTGAAGC 
GATAGCTATG 
CTATTGCTTC 



TCGTCGTGTC 
CTGCTCGCCC 
GGTTGATTTG 
TGCGTGATGC 
AAGGGGTCTG 
AGCTTGTATT 
CAGTGCGTGA 
ACTATTTCTG 
AATTAGTCTG 
AAACAGTGGT 
ATTGCAGGTG 
GGTTGAGCGC 
CCAAGTTTGA 
CACAGTCCTT 
TGACATTACC 
CGGGGGATAA 
GACAAGGGTC 
TGGT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
894 



30 



2) INFORMATION FOR SEQ ID NO: 1753 



35 



40 



45 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 891 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ureaplasma urealyticum 

(B) STRAIN: ATCC 33697 

(C) ACCESSION NUMBER: Z34275 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1753 



AATATGATTA 
TGCATCTGAT 
GTCAAGTTGG 
ATGACAGATC 
ATTATCTAAA 



CAGGGGCAGC 
GGGGTTATGG 
TGTTCCAAAA 
CAGATATGCA 
TATGGATTTG 



ACAAATGGAT 
CTCAAACTAA 
ATCGTTGTTT 
AGATCTTGTT 
ATGGCGATAA 



GGAGCAATTT 
AGAACATATT 
TCTTAAACAA 
GAAATGGAAG 
CACACCAGTT 



TAGTTATTGC 
TTATTAGCAC 
ATGTGATTTC 
TTCGTGAATT 
ATTCGTGGTT 



50 
100 
150 
200 
250 



916 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



10 



CAGGTCTTAA 
GAATTAATGG 
TGACAAACCA 
GTGGTACAGT 
GATGAGGTTG 
AGGAATTGAA 
ATGCTGGTAT 
CAAGTACTTG 
TAAAGTTTAT 
TTTCAGGATA 
GCTATTTCAT 
TGAAATGACT 
AATTCTCAAT 



GGCTTTAGAA 
ACGCAGTTGA 
TTCTTATTAG 
AGTAACTGGA 
AAATTGTTGG 
ATGTTTAGAA 
TTTATTACGT 
TAAAACCAGG 
ATTCTTAAAA 
CCGTCCACAA 
TACCTGCTGG 
GTAGAATTAA 
CCGTGAAGGT 



GGAGATCCAG 
TTCATGAATT 
CAATTGAAGA 
CGTGTTGAAC 
TCTAAAAGAC 
AATCATTAGA 
GGTATTAAAA 
ATCAATTAAA 
AAGAAGAAGG 
TTCTATTTTA 
TGTTGATTTG 
TTGCTCCAGT 
GGTAAAACTG 



TTTGAGAAGC 
CCATTACCAG 
TGTATTCACA 
GTGGTGTATT 
ACTCAAAAAA 
TCAAGCTGAA 
AAGAAGATGT 
CCTCACCGTA 
TGGACGTCAT 
GAACAACAGA 
GTTATGCCAG 
TGCGATTGAA 
TAGGTCATGG 



AAAAATTGAT 
AACGTAGTAC 
ATTTCAGGAC 
AAAAGTTAAT 
CTGTTGTTAC 
GCTGGTGATA 
TGAACGTGGT 
CTTTTACTGC 
ACACCTATTG 
TGTAACAGGT 
GTGATGACGT 
GATGGATCTA 
T 



300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
891 



15 



2) INFORMATION FOR SEQ ID NO: 1754 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 909 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



30 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Wolinella succinogenes 

(B) STRAIN: DSM 1740 

(C) ACCESSION NUMBER: X76862 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1754 



AACATGATTA 

35 TGCGGCGGAT 
GACAAGTAGG 
GTTGATGACG 
TCTTAGCAAC 
CCGCTCTTAA 

40 TGGGGCGAGA 
TACGCCTGAG 
TATTCTCCAT 
GGCGTGGTTA 
ACAAAAAACA 

45 AGGGTGAGGC 
GAAGATGTTG 
TCACACTAAC 
G AC G AC AC AC 
ACTACAGACG 

50 TATGCCTGGT 
CCCTCGAAGA 
GGTGCGGGT 



CAGGTGCTGC 
GGCCCCATGC 
CGTTCCTTAC 
CTGAGCTTCT 
TACGACTTCC 
AGCTCTTGAA 
AAGTATTGAA 
CGAGATGTGG 
CGCGGGTCGT 
AAGTCGGTGA 
ACCGTAACTG 
GGGTGACAAC 
AGAGAGGTAT 
TTTGAAGGTG 
TCCATTCTTC 
TTACCGGTTC 
GACAACGTTA 
GGGAACACGA 



TCAAATGGAT 
CCCAAACTAG 
ATCGTGGTTT 
TGAGCTTGTT 
CTGGAGATGA 
GAGGCTAACG 
GCTTATGGCT 
ATAAGCCTTT 
GGAACCGTTG 
CG AAGTAGAA 
GCGTTGAGAT 
GTTGGTGTTC 
GGTTCTTTGT 
AAGTTTACGT 
AATGGATACC 
TATCTCTCTT 
AGATCAATGT 
TTCGCGATCC 



GGCGCGATTC 
GGAGCACATT 
TCTTGAACAA 
GAAATGGAAG 
CACTCCTATC 
ACCAGGAAAA 
GAGGTTGACC 
CCTTATGCCT 
TGACAGGAAG 
ATCGTTGGTA 
GTTCCGAAAA 
TTTTGAGAGG 
AAAATAGGTT 
TCTTTCCAAA 
GACCTCAGTT 
CCTGAGGGCG 
TGAGCTTATC 
GTGAAGGTGG 



TTGTTGTTTC 
CTTCTTTCTC 
AGAAGATATG 
TTAGAGAACT 
GTTGCAGGTT 
TGTTGGCGAG 
GATATATTCC 
GTTGAAGACG 
AATTGAAAGA 
TCCGAAACAC 
GAGCTCGACA 
CACCAAGAAA 
CTATCACTCC 
GAGGAAGGCG 
CTATGTTAGA 
TAGAGATGGT 
GCTCCTGTAG 
TCGAACCGTT 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
909 
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2) INFORMATION FOR SEQ ID NO: 1755 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 888 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



10 



15 



20 



25 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Burkholderia cepacia 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1755 



ATGATCACGG 
AGCAGACGGC 
AGGTTGGTGT 
GACGACGCTG 
GTCGAAGTAC 
CCAAGCTGGC 
ATGAGCCTGG 
AGTTGACGGC 
GCCGTGGTAC 
GGCGAAGAAA 
CACGGGCGTT 
ACAACGTCGG 
GGCCAGGTTC 
GGCTGAAGTG 
TCTTCAACAA 
GGCTCGATCG 
CGTGTCGATC 
TGCGCTTCGC 



GCGCAGCGCA 
CCGATGCCGC 
TCCGTACATC 
AACTGCTCGA 
GACTTCCCGG 
GCTGGAAGGC 
CAGACGCGCT 
GCGTTCCTGA 
GGTGGTGACG 
TCGAAATCGT 
GAAATGTTCC 
TATCCTGCTG 
TGGCGAAGCC 
TACGTGCTGA 
CTACCGTCCG 
AGCTGCCGAA 
ACGGTGAAGC 
AATCCGTGAA 



GATGGACGGC 
AAACGCGTGA 
ATCGTGTTCC 
GCTGGTCGAG 
GCGACGACAC 
GACACGGGCG 
GGACACGTAC 
TGCCGGTGGA 
GGTCGTGTCG 
CGGTATCAAG 
GCAAGCTGCT 
CGCGGCACGA 
GGGTTCGATC 
GCAAGGACGA 
CAGTTCTACT 
GGACAAGGAA 
TGATTGCTCC 
GGCGGCCGTA 



GCGATCCTGG 
GCACATCCTG 
TGAACAAGTG 
ATGGAAGTTC 
GCCGATCGTG 
AGCTGGGCGA 
ATCCCGACGC 
AGACGTGTTC 
AGCGCGGCAT 
CCGACGGTGA 
GGACCAAGGT 
AGCGTGAAGA 
ACGCCGCACA 
AGGCGGCCGT 
TCCGTACGAC 
ATGGTGATGC 
GATCGCGATG 
CGGTCGGC 



TTTGCTCGGC 
CTGGCGCGTC 
CGACAGTGTG 
GCGAACTCCT 
AAGGGTTCGG 
AGTGGCGATC 
CGGAGCGTGC 
TCGATCTCGG 
CGTGAAGGTC 
AGACGACCTG 
CAGGCAGGCG 
CGTGGAGCGT 
CGCACTTCAC 
CACACGCCGT 
GGACGTGACG 
CGGGCGACAA 
GAAGAAGGTC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
888 



35 



2) INFORMATION FOR SEQ ID NO: 1756 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



50 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9444 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1756 



TGAAAGTTCA GGTAAAACAA CAGTTTCATT ACACGCAATT GCAGAAGTAC 



50 
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AGCGTCAAGG TGGACAAGCA 
CCTGTATATG CACAAAAACT 
ACAACCTGAT ACAGGGGAGC 
GAAGTGGTGC GGTTGATATT 
CCGAAAGCTG AAATTGAAGG 
AGCTCGTCTA ATGTCTCAAG 
AATCAAAAAC AATCGCAATC 



GCGTTCATTG ATGCTGAGCA TGCAATGGAT 
AGGTGTTAAC ATCGATGAAT TACTATTATC 
AAGGTTTAGA AATCGCAGAA GCACTTGTAC 
ATCGTAATTG ACTCTGTAGC AGCTCTTGTA 
AGACATGGGT GACTCACACG TAGGTTTACA 
CACTTCGTAA ACTTTCAGGT GCAATCAATA 
TTTATTAACC AAATTCGT 



100 
150 
200 
250 
300 
350 
388 



2) INFORMATION FOR SEQ ID NO: 1757 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: ATCC 4229 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1757 

TGAAAGTTCA GGTAAAACAA CAGTTTCATT ACACGCAATT GCAGAAGTAC 50 

AGCGTCAAGG TGGACAAGCA GCGTTCATTG ATGCTGAGCA TGCAATGGAT 100 

CCTGTATATG CACAAAAACT AGGTGTTAAC ATCGATGAAT TACTATTATC 150 

ACAACCTGAT ACAGGGGAGC AAGGTTTAGA AATCGCAGAA GCACTTGTAC 200 

GAAGTGGTGC GGTTGATATT ATCGTAATTG ACTCTGTAGC AGCTCTTGTA 250 

CCGAAAGCTG AAATTGAAGG AGACATGGGT GACTCACACG TAGGTTTACA 300 

AGCTCGTCTA ATGTCTCAAG CACTTCGTAA ACTTTCAGGT GCAATCAATA 350 

AATCAAAAAC AATCGCAATC TTTATTAACC AAATTCGT 388 



2) INFORMATION FOR SEQ ID NO: 1758 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 7064 

(C) ACCESSION NUMBER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1758 
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TGAAAGTTCA GGTAAAACAA 
AGCGTCAAGG TGGACAAGCA 
CCTGTATATG CACAAAAACT 
ACAACCTGAT ACAGGGGAGC 
GAAGTGGTGC GGTTGATATT 
CCGAAAGCTG AAATTGAAGG 
AGCTCGTCTA ATGTCTCAAG 
AATCAAAAAC AATCGCAATC 



CAGTTTCATT ACACGCAATT 
GCGTTCATTG ATGCTGAGCA 
AGGTGTTAAC ATCGATGAAT 
AAGGTTTAGA AATCGCAGAA 
ATCGTAATTG ACTCTGTAGC 
AGACATGGGT GACTCACACG 
CACTTCGTAA ACTTTCAGGT 
TTTATTAACC AAATTCGT 



GCAGAAGTAC 50 

TGCAATGGAT 100 

TACTATTATC 150 

GCACTTGTAC 200 

AGCTCTTGTA 250 

TAGGTTTACA 300 

GCAATCAATA 350 

388 



2) INFORMATION FOR SEQ ID NO: 1759 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 13472 

(C) ACCESSION NUMBER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1759 

TGAAAGTTCA GGTAAAACGA CAGTTTCATT ACATGCAATT GCAGAAGTAC 50 

AACGTCAAGG TGGACAAGCA GCATTCATCG ATGCGGAGCA CGCAATGGAT 100 

CCTGTATATG CACAAAAATT AGGCGTTAAC ATAGATGAAT TACTATTATC 150 

ACAGCCTGAT ACAGGGGAGC AAGGATTAGA AATCGCGGAA GCACTTGTAC 200 

GAAGTGGTGC GGTTGACATT ATCGTAATTG ACTCTGTAGC AGCTCTTGTA 250 

CCGAAAGCAG AGATTGAAGG TGACATGGGT GACTCACACG TAGGTTTACA 300 

AGCACGTTTA ATGTCACAAG CACTTCGTAA GCTTTCAGGA GCAATCAACA 350 

AATCAAAAAC AATTGCAATC TTTATTAACC AAATTCGT 388 



2) INFORMATION FOR SEQ ID NO: 1760 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 374 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus mycoides 

(B) STRAIN: ATCC 64 62 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1760 

AAACGACAGT TTCATTACAT GCAATTGCAG AAGTACAACG TCAAGGTGGA 50 

5 CAAGCAGCAT TCATCGATGC GGAGCACGCA ATGGATCCTG TATATGCACA 100 

AAAATTAGGC GTTAACATAG ATGAATTACT ATT AT C AC AG CCTGATACAG 150 

GGGAGCAAGG ATTAGAAATC GCAGAAGCAC TTGTACGAAG TGGTGCGGTT 200 

GACATTATCG TAATTGACTC TGTAGCAGCT CTTGTACCGA AAGCAGAGAT 250 

TGAAGGAGAC ATGGGTGACT CACACGTAGG TTTACAAGCA CGTTTAATGT 300 

10 CACAAGCACT TCGTAAGCTT TCAGGAGCAA TCAACAAATC AAAAACAATT 350 

GCAATCTTTA TTAACCAAAT TCGT 374 



15 2) INFORMATION FOR SEQ ID NO: 1761 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 381 bases 

(B) v TYPE: Nucleic acid 
20 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

2 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus pseudomycoides 

(B) STRAIN: NRRL BD-10 



30 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1761 



GGAAAGTTCA GGTAAAACAA CGGTTTCCTT ACATGCGATT GCAGAAGTGC 50 

AACGTCAAGG TGGACAAGCG GCATTTATTG ATGCGGAGCA TGCGATGGAT 100 

CCTGTATATG CACAAAAGTT AGGTGTTAAT ATTGATGAGT TACTATTATC 150 

GCAGCCTGAT ACAGGAGAAC AAGGTTTAGA AATCGCAGAA GCATTAGTAC 200 

35 GAAGCGGTGC GATTGATATC ATTGTAATTG ACTCTGTAGC AGCTCTTGTA 250 

CCAAAAGCAG AAATCGAAGG GGAAATGGGT GACTCCCACG TTGGTTTACA 300 

AGCGCGTTTA ATGTCACAAG CACTTCGTAA GCTTTCTGGT GCGATTAACA 350 

AATCAAAAAC AATTGCAATC TTCATTAACC A 381 



2) INFORMATION FOR SEQ ID NO: 1762 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 
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(B) STRAIN: HER 1410 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1762 



TGAAAGTTCA GGTAAAACGA 
AACGTCAAGG TGGACAAGCA 
CCTGTATATG CACAAAAATT 
ACAGCCTGAT ACAGGGGAGC 
GAAGTGGTGC GGTTGACATT 
CCGAAAGCAG AGATTGAAGG 
AGCACGTTTA ATGTCACAAG 
AATCAAAAAC AATTGCAATC 



CAGTTTCATT ACATGCAATT GCAGAAGTAC 
GCATTCATCG ATGCGGAGCA CGCAATGGAT 
AGGCGTTAAC ATAGATGAAT TACTATTATC 
AAGGATTAGA AATCGCGGAA GCACTTGTAC 
ATCGTAATTG ACTCTGTAGC AGCTCTTGTA 
CGACATGGGT GACTCACACG TAGGTTTACA 
CACTTCGTAA GCTTTCAGGA GCAATCAACA 
TTTATTAACC AAATTCGT 



50 
100 
150 
200 
250' 
300 
350 
388 



2) INFORMATION FOR SEQ ID NO: 1763 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: HER 1418 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1763 

TGAAAGTTCA GGTAAAACGA CAGTTTCATT ACATGCAATT GCAGAAGTAC 50 

AACGTCAAGG TGGACAAGCA GCATTCATTG ATGCGGAGCA CGCAATGGAT 100 

CCTGTATATG CACAAAAATT AGGCGTTAAC ATAGATGAAT TACTATTATC 150 

ACAGCCTGAT ACAGGGGAGC AAGGATTGGA AATCGCGGAA GCACTTGTAC 200 

GAAGTGGTGC GGTTGACATT ATCGTAATTG ACTCTGTAGC AGCTCTTGTA 250 

CCGAAAGCAG AGATTGAAGG CGATATGGGT GACTCACACG TAGGTTTACA 300 

AGCACGTTTA ATGTCACAAG CACTTCGTAA GCTTTCAGGA GCAATCAACA 350 

AATCAAAAAC AATTGCAATC TTTATTAACC AAATTCGT 388 



2) INFORMATION FOR SEQ ID NO: 1764 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 358 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 33496 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17 64 



CTCCTATCTG GATTATGCGA 
ATGTCCGAGA TGGCCTGAAG 
AACGTATTGG GCAATGACTG 
CGTGGGTGAC GTCATCGGTA 
ACGACACCAT TGTACGTATG 
GTAGATGGCC AGGGTAACTT 
GATGCGTTAT ACGGAAATCC 
CCGACCTC 



TGTCGGTCAT TGTTGGCCGT GCGCTGCCGG 
CCGGTACACC GTCGCGTACT ATACGCCATG 
GAACAAAGCC TATAAAAAAT CTGCCCGTGT 
AATACCACCC TCATGGTGAT ACTGCCGTAT 
GCGCAGCCCT TCTCCCTGCG TTACATGCTG 
TGGTTCGGTC GACGGCGACT CCGCCGCAGC 
GTATGTCGAA GATCGCCCAT GAACTGATGG 



50 
100 
150 
200 
250 
300 
350 
358 



2) INFORMATION FOR SEQ ID NO: 1765 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 365 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. ozaenae 

(B) STRAIN: ATCC 11296 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1765 

TTAAGAACTC TTATCTGGAT TATGCGATGT CGGTCATTGT TGGCCGTGCG 50 

CTGCCGGATG TCCGAGATGG CCTGAAGCCG GTACACCGTC GCGTACTTTA 100 

CGCCATGAAC GTATTGGGCA ATGACTGGAA CAAAGCCTAT AAAAAATCAG 150 

CCCGTGTCGT TGGTGACGTA ATCGGTAAAT ACCACCCGCA CGGCGACTCC 200 

GCGGTATACG ACACCATCGT GCGTATGGCG CAGCCGTTCT CGCTGCGTTA 250 

CATGCTGGTG GACGGCCAGG GTAACTTTGG TTCCATCGAC GGCGACTCCG 300 

CCGCGGCGAT GCGTTATACC GAAATTCGTC TGGCGAAAAT CGCTCATGAG 350 

CTGATGGCCG ATCTT 365 



2) INFORMATION FOR SEQ ID NO: 1766 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 344 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella planticola 

(B) STRAIN: ATCC 33531 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1766 



AAGAGCTCGT ATCTGGATTA 
GCCGGATGTC CGAGATGGCC 
CCATGAACGT ATTGGGCAAT 
CGTGTCGTTG GTGACGTAAT 
CGTTTATGAC ACCATTGTAC 
TGCTGGTCGA TGGCCAGGGT 
GCAGCGATGC GTTATACGGA 



TGCGATGTCG GTCATTGTTG 
TGAAACCGGT ACACCGTCGC 
GACTGGAACA AAGCCTATAA 
CGGTAAATAC CACCCTCATG 
GTATGGCACA GCCATTCTCC 
AACTTCGGTT CTGTCGATGG 
AATCCGTATG TCGAAAATCG 



GCCGTGCGCT 
GTACTTTACG 
AAAATCCGCC 
GTGATACCGC 
TTGCGTTATA 
CGACTCCGCC 
CCCA 



50 
100 
150 
200 
250 
300 
344 



2) INFORMATION FOR SEQ ID NO: 1767 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: ATCC 27336 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1767 

AACTCTTATC TGGATTATGC GATGTCGGTC ATTGTTGGCC GTGCGCTGCC 50 

GGATGTCCGA GATGGCCTGA AGCCGGTACA CCGTCGCGTA CTTTACGCCA 100 

TGAACGTATT GGGCAATGAC TGGAACAAAG CCTATAAAAA ATCAGCCCGT 150 

GTCGTTGGTG ACGTAATCGG TAAATACCAC CCGCACGGCG ACTCCGCGGT 200 

ATACGACACC ATCGTGCGTA TGGCGCAGCC GTTCTCGCTG CGTTACATGC 250 

TGGTGGACGG CCAGGGTAAC TTTGGTTCCA TCGACGGCGA CTCCGCCGCG 300 

GCGATGCGTT ATACCGAAAT TCGTCTGGCG AAAATCGCTC ATGAG 345 



2) INFORMATION FOR SEQ ID NO: 1768 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 356 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 13883 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1768 



2) INFORMATION FOR SEQ ID NO: 1769 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 361 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 29011 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1769 

TAAGAACTCT TATCTGGATT ATGCGATGTC GGTCATTGTT GGCCGTGCGC 50 

TGCCGGATGT CCGAGATGGC CTGAAGCCGG TACACCGTCG CGTACTTTAC 100 

GCCATGAACG TATTGGGCAA TGACTGGAAC AAAGCCTATA AAAAATCAGC 150 

CCGTGTCGTT GGTGACGTAA TCGGTAAATA CCACCCGCAC GGCGACTCCG 200 

CGGTATACGA CACCATCGTG CGTATGGCGC AGCCGTTCTC GCTGCGTTAC 250 

ATGCTGGTGG ACGGCCAGGG TAACTTTGGT TCCATCGACG GCGACTCCGC 300 

CGCGGCGATG CGTTATACCG AAATTCGTCT GGCGAAAATC GCTCATGAGC 350 

TGATGGCCGA T 361 



2) INFORMATION FOR SEQ ID NO: 1770 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 365 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



CTTATCTGGA TTATGCGATG 
GTCCGAGATG GCCTGAAGCC 
CGTATTGGGC AATGACTGGA 
TTGGTGACGT AATCGGTAAA 
GACACCATCG TGCGTATGGC 
GGACGGCCAG GGTAACTTTG 
TGCGTTATAC CGAAATTCGT 
GATCTT 



TCGGTCATTG TTGGCCGTGC 
GGTACACCGT CGCGTACTTT 
ACAAAGCCTA TAAAAAATCA 
TACCACCCGC ACGGCGACTC 
GCAGCCGTTC TCGCTGCGTT 
GTTCCATCGA CGGCGACTCC 
CTGGCGAAAA TCGCTCATGA 



GCTGCCGGAT 
ACGCCATGAA 
GCCCGTGTCG 
CGCGGTATAC 
ACATGCTGGT 
GCCGCGGCGA 
GCTGATGGCC 



50 
100 
150 
200 
250 
300 
350 
356 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. 
rhinoscleroma tis 

(B) STRAIN: ATCC 13824 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1770 

TAAGAACTCT TATCTGGATT ATGCGATGTC GGTCATTGTT GGCCGTGCGC 50 

TGCCGGATGT CCGAGATGGC CTGAAGCCGG TACACCGTCG CGTACTTTAC 100 

10 GCCATGAACG TATTGGGCAA TGACTGGAAC AAAGCCTATA AAAAATCAGC 150 

CCGTGTCGTT GGTGACGTAA TCGGTAAATA CCACCCGCAC GGCGACTCCG 200 

CGGTATACGA CACCATCGTG CGTATGGCGC AGCCGTTCTC GCTGCGTTAC 250 

ATGCTGGTGG ACGGCCAGGG TAACTTTGGT TCCATCGACG GCGATTCCGC 300 

CGCGGCGATG CGTTATACCG AAATTCGTCT GGCGAAAATC GCTCATGAGC 350 

15 TGATGGCCGA TCTTG 365 



2) INFORMATION FOR SEQ ID NO: 1771 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 357 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
25 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
30 (A) ORGANISM: Klebsiella terrigena 

(B) STRAIN: ATCC 33257 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1771 

35 TCGTATCTGG ATTATGCGAT GTCGGTCATT GTTGGCCGTG CGCTGCCGGA 50 

TGTCCGAGAT GGGTTGAAAC CGGTACACCG TCGCGTACTT TACGCCATGA 100 

ACGTATTGGG CAATGACTGG AACAAAGCCT ATAAAAAATC CGCCCGTGTC 150 

GTTGGTGACG TAATCGGTAA ATATCACCCT CACGGTGATA CCGCCGTTTA 200 

TGACACCATT GTACGTATGG CGCAGCCATT CTCCTTGCGT TATATGCTGG 250 

4 0 TCGATGGCCA GGGTAACTTC GGTTCTGTCG ATGGCGACTC CGCCGCAGCG 300 

ATGCGTTATA CGGAAATCCG TATGTCGAAA ATCGCCCACG AGCTGATGGC 350 

CGACCTC 357 



45 

2) INFORMATION FOR SEQ ID NO: 1772 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 968 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Legionella pneumophila subsp. pneumophila 
5 (B) STRAIN: ATCC 33152 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1772 

CAGTACACAG GCGAGTTCTT TTTGCGATGA GTGAGTTAAG TAATGATTGG 50 

10 AATAAGCCGT ATAAAAAATC TGCTCGTGTA GTAGGGGATG TCATTGGTAA 100 

ATATCATCCT CATGGGGATA CAGCTGTTTA TGACACTATT GTTCGTATGG 150 

CTCAGCCCTT TTCCATGCGT TATATGCTGA TTGATGGGCA GGGTAATTTT 200 

GGCTCTGTAG ATGGAGATGC TCCAGCTGCC ATGCGTTACA CTGAAGTAAG 250 

AATGTCCAAA GTGGCGCATG CTTTACTGGC TGATTTGGAT AAGGAAACCG 300 

15 TTGATTTTAG TCCTAACTAT GATGAAACAG AATTTGCTCC AGTGGTATTG 350 

CCATCGAGAA TTCCCAATTT ACTAGTTAAT GGCTCTTCCG GTATTGCGGT 4 00 

AGGGATGGCT ACTAATATTC CACCACATAA TCTTACCGAA GTAATCAATG 4 50 

CATGTATTGC TTTAGTGGAT GAACCTGACA CGAGTCTTGA AGATTTAATG 500 

GAAATTATTC CTGGCCCTGA TTTTCCTACA GCCGCAATTA TTAATGGTCG 550 

20 TGCTGGAATT ATTGAAGGTT ATCGTACTGG AAAAGGGCGG GTTGTTATCA 600 

GGGCACGCAC AGAAATTGAA ACGGATGAAA GTTCAGGCCG TCAGTCAATT 650 

ATTATTCAGG AATTACCCTA TCAGGTGAAT AAAGCGCGTT TGATCGAGCG 700 

TATTGCTGAA TTGGTAAGGG ACAAGAAAAT CGAAGGAATT TCCGGCTTGA 750 

GAGATGAGTC AGACAAGCAA GGAATGAGAG TAGTCATTGA ATTAAAACGC 800 

25 AATGAAGTAG CAGATGTGGT ATTGAATAAC CTGTTCGCTC ATACTCAAAT 8 50 

GCAAAATGTA TTCGGAATTA ATATGGTTGC TCTGGTGGAT GGCCAACCGC 900 

GTACTTTGAA TTTGAAGCAA ATACTGGAAT ATTTTATAAA ACATCGAAGA 950 

GAGGTTGTTA CCAGACGC 968 



30 



45 



2) INFORMATION FOR SEQ ID NO: 1773 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 967 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus mirabilis 

(B) STRAIN: ATCC 25933 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1773 



CACCGCCGAG TGCTATTTGC GATGAATGTA CTGGGAAACG ATTGGAATAA ' 50 

ACCTTATAAA AAATCAGCCC GTGTTGTTGG GGATGTAATC GGTAAATATC 100 

50 ACCCGCACGG TGACAGTGCT GTCTATGAAA CGATTGTTCG TTTAGCACAG 150 

CCTTTTTCTA TGCGCTACAT GTTGGTTGAC GGTCAGGGTA ACTTCGGGTC 200 

AGTTGATGGT GACTCCGCGG CGGCTATGCG TTATACCGAA GTTCGTATGG 250 

CGAAGATCGC CCATGAACTG CTGGCGGATT TGGAAAAAGA GACGGTCGAC 300 
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TTTGTTCCTA 
CCGTATTCCA 
TGGCAACGAA 
CTTGCCTATG 
TATTACCGGG 
GAATATTAGA 
CAGGCTGATA 
GACAGAAATT 
CGGAGCTTGT 
GAGTCTGATA 
AGTCGGTGAA 
TCTCTTTTGG 
TTGAACCTAA 
GGTGACTCGC 



ACTATGATGG 
AACTTGTTAG 
TATCCCTCCG 
TTGATAATGA 
CCTGATTTTC 
TGCTTATCGT 
TTGAAACTGA 
CCTTATCAGG 
AAAAGATAAA 
AAGACGGTAT 
GTAGTATTAA 
TATTAATATG 
AAGAAATTAT 
CGTACCA 



AACAGAAAAT ATACCGGCTG TTATGCCAAC 
TTAATGGCTC TTCAGGTATT GCCGTTGGGA 
CATAACCTCG- GTGAAGTTAT CGACGGTTGT 
AGACATCACC ATAGAAGAAT TAATGGAATA 
CGACTGCTGC GATTATTAAT GGTCGCAGAG 
ACAGGGCGTG GAAAGATTTA TATCCGTGCT 
TGAGAAAACA GGTCGCGAAA CCATTATCGT 
TGAATAAAGC CCGTTTAATT GAAAAAATTG 
CGTATTGAAG GTATCAGCGG ATTACGTGAC 
GCGTATTGTT GTTGAGATCA AACGTGATGC 
ATCACCTCTT TTCACAAACT CAAATGCAAG 
GTTGCGCTTC ATCAAGGCCA ACCAAAATTA 
CGCAGCCTTT ATTCGCCATC GTCGTGAAGT 



350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 
950 
967 



10 



15 



20 



25 



30 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1774 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 978 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia rettgeri 

(B) STRAIN: ATCC 9250 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1774 

CCAGTACACC GCAGAGTATT GTATGCGATG AATGTATTGG GAAATGATTG 50 

GAATAAACCC TATAAAAAAT CTGCCCGTAT TGTTGGGGAC GTCATCGGTA 100 

AATACCATCC ACATGGTGAT AGCGCTGTTT ACGAGACAAT CGTTCGTCTT 150 

GCTCAGCCTT TCTCAATGCG TTACATGCTG GTTGATGGTC AGGGAAACTT 200 

CGGTTCTGTT GACGGAGACT CCGCAGCGGC AATGCGTTAT ACGGAAATCC 250 

GTATGGCGAA AATTGCCCAT GAACTACTTG CTGACCTTGA AAAAGAAACC 300 

GTTGATTTCG TTCCTAACTA TGACGGAACA GAGCACATTC CTGAAGTTAT 350 

GCCAACGAAA ATCCCAAACC TTTTGGTTAA TGGGTCGTCA GGTATTGCTG 400 

TTGGGATGGC AACCAATATC CCACCTCACA ATTTAGGGGA GGTGATTAAT 450 

GGTTGTCTTG CCTATATAGA AGACGAAGAC ATCAGCATTG ATGGTTTAAT 500 

GGAACACATT CCAGGGCCTG ATTTCCCAAC CGCAGCTATT ATTAATGGCC 550 

GTCGTGGGAT TATTGATGCG TATCGCACAG GGCGTGGCAA GGTCTATATC 600 

CGTGCAAGCG CTGAAGTGGA AGTCGATGAG AAAAATGGTC GCGAAACCAT 650 

TATTGTCAGC GAAATTCCTT ATCAAGTGAA TAAAGCTCGC TTGATTGAAA 700 

AAATTGCTGA GTTAGTTAAA GACAAGCGTG TTGAAGGTAT CAGTGCACTG 750 

CGTGACGAGT CTGATAAAGA CGGTATGCGT ATTGTTATTG AAATCAAACG 800 

CGATGCGGTG GGTGAAGTTG TACTGAACAA CTTATATTCC CTGACCCAAT 850 

TGCAAGTTTC TTTTGGTATC AATATGGTAG CTCTACACCA AGGGCAGCCG 900 

AAAATACTGA ATTTAAAAGA TATCATTGCT GCTTTTGTGC GTCACCGCCG 950 

TGAAGTCGTC ACTCGTCGTA CGATTTTC 978 
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2) INFORMATION FOR SEQ ID NO: 1775 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 978 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Proteus vulgaris 

(B) STRAIN: ATCC 13315 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1775 

CCAGTACACC GTCGCGTTCT TTTCGCGATG AATGTATTAG GAAACGATTG 50 

GAATAAACCT TATAAAAAAT CAGCCCGTGT TGTTGGGGAT GTTATCGGTA 100 

AATATCACCC GCACGGTGAC AGTGCTGTTT ATGAAACGAT AGTTCGTTTA 150 

GCACAGCCTT TTTCTATGCG TTACATGTTG GTTGACGGGC AGGGTAACTT 200 

CGGGTCAGTT GATGGTGACT CGGCGGCTGC AATGCGTTAT ACCGAAGTTC 250 

GTATGGCGAA AATCGCCCAT GAACTGCTGG CGGATTTGGA AAAAGAAACG 300 

GTTGATTTTG TTCCTAACTA TGATGGAACA GAGCATATCC CGGCAGTCAT 350 

GCCAACCCGT ATTCCAAACT TATTAGTCAA TGGTTCTTCA GGTATCGCAG 400 

TCGGGATGGC AACAAACATT CCTCCGCATA ACCTAGGTGA AGTTATCGAC 4 50 

GGCTGTCTTG CTTATGTTGA TAACGAAGAC ATCACTATTG AAGAGTTGAT 500 

GGAGCATATC ACGGGGCCTG ATTTCCCAAC TGCCGCTATT ATTAATGGCC 550 

GCAGAGGAAT TTTAGATGCT TACCGTACTG GGCGCGGAAA AATTTATATT 600 

CGTGCACAAG CTGATGTAGA AACCGATGAG AAAACTGGTC GCGAAACAAT 650 

TATCGTGACG GAAATTCCTT ATCAGGTGAA CAAAGCTCGC TTAATTGAAA 7 00 

AAATTGCAGA GCTTGTTAAA GATAAACGTA TTGAAGGCAT TAGCGGATTA 750 

CGTGATGAGT CAGATAAAGA TGGTATGCGC ATTGTTGTTG AAATTAAGCG 800 

TGATGCTGTT GGTGAAGTTG TACTAAATCA CTTATTTTCT CAGACTCAGA 850 

TGCAGGTTTC TTTTGGTATT AACATGGTTG CACTGCATCA AGGTCAACCG 900 

AAAGTGTTAA ACCTGAAAGA AATTATTTCA GCCTTTATTC GTCACCGTCG 950 

TGAAGTGGTG ACTCGTCGTA CTATTTTT 978 



2) INFORMATION FOR SEQ ID NO: 1776 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 940 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia enterocolitica 

(B) STRAIN: ATCC 9610 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1776 

TGGGTAATGA CTGGAATAAG CCATACAAAA AATCGGCCCG TGTAGTCGGG 50 

GACGTTATCG GTAAATATCA CCCGCATGGT GACAGCGCGG TCTACGACAC 100 

5 AATTGTGCGT ATGGCCCAGC CGTTCTCACT GCGCTATATG CTGGTGGATG 150 

GGCAGGGCAA CTTCGGTTCC GTTGATGGCG ACTCCGCCGC AGCGATGCGT 200 

TATACCGAAA TCCGTATGTC TAAAATTGCT CACGAATTGT TGGCGGACTT 250 

AGAAAAAGAT ACCGTCGACT TCGTGCCGAA CTATGACGGT ACGGAGCAAA 300 

TTCCTGCCGT AATGCCAACC CGAATCCCTA ACTTGCTGGT TAACGGCTCG 350 

10 TCAGGTATTG CTGTCGGTAT GGCAACCAAT ATTCCGCCGC ATAACCTTTC 400 

TGAGGTTATT GATGGCTGTC TGGCCTATAT CGAAGATGAA AACATCACCA 450 

TTGAAGGGTT GATGGAGTAC ATCCCGGGGC CAGATTTCCC AACTGCTGCG 500 

ATTATCAATG GTCGCCGTGG TATTGAAGAA GCTTATCGTA CTGGCCGTGG 550 

CAAGGTGTAT ATCCGTGCCC GTGCTGAAGT TGAGGCTGAC GCTTAAACCG 600 

15 GTCGCGAAAC CATTATTGTT CACGAGATCC CGTATCAGGT GAACAAGGCG 650 

CGGTTGATTG AAAAAATCGC CGAGCTGGTT AAAGAAAAAC GCGTAGAAGG 700 

CATCAGTGCG TTGCGTGATG AGTCTGATAA AGACGGCATG CGTATCGTGA 750 

TTGAAATCAA ACGTGATGCT GTCGGGGAAG TGGTTCTGAA CAACCTCTAT 800 

TCTCTGACGC AATTGCAGGT GACTTTCGGT ATCAATATGG TGGCTCTGTC 850 

20 TCAAGGGCAG CCTAAGTTGC TTAACCTGAA AGACATTTTG GTTGCTTTCG 900 

TGCGCCACCG CCGTGAAGTG GTGACTCGCC GTACCATTTT 940 



25 2) INFORMATION FOR SEQ ID NO: 1777 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 13182 



40. 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1777 



CCGTTTATTG GCGATGGCCT GAAGCCGGTC CAGCGTCGCA TCGTCTATGC 50 

GATGTCTGAA CTGGGTCTGA ACGCCAGCGC GAAGTTCAAA AAGTCCGCCC 100 

GCACCGTCGG TGACGTGCTG GGTAAATACC ATCCCCACGG CGACAGCGCG 150 

TGCTATGAAG CCATGGTGCT GATGGCTCAG CCCTTCTCCT ACCGCTATCC 200 

45 GCTGGTTGAC GGTCAGGGAA ACTGGGGGGC GCCGGACGAT CCTAAATCCT 250 

TCGCCGCAAT GCGTTATACC GAATCCCGTT TGTCGAAGTA TGCTGAACTG 300 

CTGCTGAGCG AACTGGGGCA AGGCACCGTT GACTGGGTAC CAAACTTCGA 350 

CGGCACTTTG CAGGAGCCGA AGATGCTGCC TGCGCGCCTG CCCAATATTC 400 

TGCTAAACGG TACTACCGGC ATTGCCGTTG GGATGGCGAC GGACATTCCG 450 

50 CCGCACAACC TGCGTGAAGT GGCCCGGGCG GCCATTACCC TGATTGAAAA 500 

GCCGCAAACT TCGCTGGATG ACCTGCTGGA TATCGTGCAG GGGCCGGATT 550 

ATCCTACCGA AGCGGAAATC ATTACCCCCC GTGCCGAAAT CCGCAAAATC 600 

TACCAGAATG GCCGCGGTTC GGTGCGGATG CGCGCGGTAT GGGCCAAAGA 650 
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5 2) INFORMATION FOR SEQ ID NO: 1778 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 714 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 33496 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1778 



GTTTATTGGC GATGGCCTGA AGCCGGTCCA GCGTCGCATC GTCTATGCGA 50 

TGTCTGAACT GGGTCTGAAC GCCAGCGCGA AGTTCAAAAA GTCCGCCCGC 100 

ACCGTCGGTG ACGTGCTGGG TAAATACCAT CCCCACGGCG ACAGCGCGTG 150 

CTATGAAGCC ATGGTGCTGA TGGCTCAGCC CTTCTCCTAC CGCTATCCGC 200 

25 TGGTTGACGG TCAGGGAAAC TGGGGGGCGC CGGACGATCC TAAATCCTTC 2 50 

GCCGCAATGC GTTATACCGA ATCCCGTTTG TCGAAGTATG CTGAACTGCT 300 

GCTGAGCGAA CTGGGGCAAG GCACCGTTGA CTGGGTACCA AACTTCGACG 350 

GCACTTTGCA GGAGCCGAAG ATGCTGCCTG CGCGCCTGCC CAATATTCTG 4 00 

CTAAACGGTA CTACCGGCAT TGCCGTTGGG ATGGCGACGG ACATTCCGCC 4 50 

30 GCACAACCTG CGTGAAGTGG CCCGGGCGGC CATTACCCTG ATTGAAAAGC 500 

CGCAAACTTC GCTGGATGAC CTGCTGGATA TCGTGCAGGG GCCGGATTAT 550 

CCTACCGAAG CGGAAATCAT TACCCCCCGT GCCGAAATCC GCAAAATCTA 600 

CCAGAATGGC CGCGGTTCGG TGCGGATGCG CGCGGTATGG GCCAAAGAAG 650 

ACGGCGCGGT GGTGATTAGC GCGCTGCCGC ATCAGGTTTC CGGCGCCCGC 7 00 

35 GTGCTTGAGC AGAT 714 



2) INFORMATION FOR SEQ ID NO: 1779 



40 



(i) SEQUENCE. CHARACTERISTICS: 

(A) LENGTH: 722 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Klebsiella pneumoniae subsp. ozaenae 

(B) STRAIN: ATCC 11296 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1779 
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TTGGCGATGG CTTAAAACCG GTCCAGCGTC 

GAGCTGGGGC TGAACGCCAG CGCGAAATTC 

CGGCGACGTG TTGGGTAAAT ATCACCCGCA 

5 AAGCGATGGT GCTGATGGCG CAGCCGTTCT 

GATGGTCAGG GAAACTGGGG GGCGCCGGAC 

CATGCGTTAC ACCGAATCCC GCCTGTCGAA 

GCGAGCTGGG GCAGGGGACG GTCGACTGGG 

CTGCAGGAGC CGAAAATGCT GCCAGCGCGT 

10 CGGCACCACC GGCATCGCGG TAGGTATGGC 

ACCTGCGGGA AGTGGCCAAA GCGGCGATTA 

ACCACCCTCG ACGAACTGCT GGATATCGTA 

CGAGGCGGAG ATCATCACCT CGCGGGCGGA 

ACGGGCGCGG CTCAGTGCGC ATGCGCGCGG 

15 GCGGTGGTGA TCAGCGCGCT GCCGCATCAG 

GGAGCAGATT GCGGCGCAGA TG 



GCATCGTCTA TGCGATGTCC 50 

AAAAAGTCCG CCCGCACCGT 100 

CGGCGACAGC GCCTGCTATG 150 

CTTACCGCTA TCCGCTGGTG 200 

GATCCCAAAT CTTTCGCCGC 250 

GTATGCCGAG CTGCTGCTCA 300 

TGCCAAACTT TGACGGTACG 350 

TTGCCGAACA TCCTGCTGAA 400 

GACCGATATT CCTCCGCACA 4 50 

CGCTGATTGA GCAGCCGAAA 500 

CAGGGGCCGG ATTTCCCGAC 550 

AATTCGAAAA ATCTACCAGA 600 

TGTGGAGTAA AGAGGACGGC 650 

GTCTCCGGCG CCAAAGTGCT 7 00 

722 



20 2) INFORMATION FOR SEQ ID NO: 1780 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 692 bases 

(B) TYPE: Nucleic acid 
25 (C)- STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella planticola 

(B) STRAIN: ATCC 33531 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1780 



GTCCAGCGTC GCATCGTTTA TGCGATGTCT GAGCTGGGGC TGAACGCCAG 50 

CGCGAAGTTC AAAAAGTCCG CCCGCACCGT GGGTGATGTG CTGGGTAAAT 100 

ATCACCCGCA CGGCGATAGC GCATGCTATG AAGCTATGGT GCTGATGGCG 150 

CAGCCATTCT CCTACCGCTA CCCGCTGGTG GATGGGCAGG GGAACTGGGG 200 

40 GGCACCGGAC GATCCTAAAT CCTTCGCCGC GATGCGTTAT ACCGAATCCC 250 

GTTTGTCGAA GTATGCGGAA CTGCTGCTGG GCGAACTGGG GCAGGGAACC 300 

GTCGACTGGG TGCCGAACTT CGACGGGACG ATGCAGGAGC CGAAAATGCT 350 

GCCTGCGCGT CTGCCGAATA TTCTGCTGAA CGGCACTACC GGCATCGCCG 400 

TCGGTATGGC AACCGATATT CCTCCGCACA ACCTGCGTGA AGTGGCGCAG 4 50 

45 GCGGCGATTA CCCTGATCGA AAAACCGCAG ACCTCGCTCG ACGAACTGCT 500 

GGATATCGTT CACGGACCCG ACTACCCGAC CGAAGCCGAA ATCATTACTC 550 

CACGCGCGGA GATCCGCAAA ATCTACCAGA ACGGCCGCGG TTCGGTGCGG 600 

ATGCGCGCGG TATGGAAAAA AGAGGACGGC GCGGTCGTGA TTACGGCGTT 650 

GCCGCATCAG GTTTCCGGCG CCCGCGTGCT GGAGCAAATT GC 692 

50 



2) INFORMATION FOR SEQ ID NO: 1781 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 700 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

10 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: ATCC 27336 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1781 



GTTTATTGGC GATGGCTTAA AACCGGTCCA GCGTCGCATC GTCTATGCGA 50 

TGTCCGAGCT GGGGCTGAAC GCCAGCGCGA AATTCAAAAA GTCCGCCCGC 100 

ACCGTCGGCG ACGTGTTGGG TAAATATCAC CCGCACGGCG ACAGCGCCTG 150 

CTATGAAGCG ATGGTGCTGA TGGCGCAGCC GTTCTCTTAC CGCTATCCGC 200 

2 0 TGGTGGATGG TCAGGGAAAC TGGGGGGCGC CGGACGATCC CAAATCTTTC 250 

GCCGCCATGC GTTACACCGA ATCCCGCCTG TCGAAGTATG CCGAGCTGCT 300 

GCTCAGCGAG CTGGGGCAGG GGACGGTCGA CTGGGTGCCA AACTTTGACG 350 

GCACGCTGCA GGAGCCGAAA ATGCTGCCAG CGCGTTTGCC GAACATCCTG 400 

CTGAACGGCA CCACCGGCAT CGCGGTAGGC ATGGCGACCG ATATTCCTCC 450 

25 GCACAACCTG CGGGAAGTGG CCAAAGCGGC GATTACGCTG ATTGAGCAGC 500 

CGAAAACCAC CCTCGACGAA CTGCTGGATA TCGTACAGGG GCCGGATTTC 550 

CCGACCGAGG CGGAGATCAT CACCTCGCGG GCGGAAATTC GCAAAATCTA 600 

CCAAAACGGG CGCGGCTCAG TGCGCATGCG CGCGGTGTGG AGTAAAGAGG 650 

ACGGCGCGGT GGTGATCAGC GCGCTGCCGC ATCAGGTCTC CGGCGCCAAA 7 00 

30 



2) INFORMATION FOR SEQ ID NO: 1782 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 726 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 
45 (B) STRAIN: ATCC 13883 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1782 

ATTGGCGATG GCTTAAAACC GGTCCAGCGT CGCATCGTCT ATGCGATGTC 50 

50 CGAGCTGGGG CTGAACGCCA GCGCGAAATT CAAAAAGTCC GCCCGCACCG 100 

TCGGCGACGT GTTGGGTAAA TATCACCCGC ACGGCGACAG CGCCTGCTAT 150 

GAAGCGATGG TGCTGATGGC GCAGCCGTTC TCTTACCGCT ATCCGCTGGT 200 

GGATGGTCAG GGAAACTGGG GGGCGCCGGA CGATCCCAAA TCTTTCGCCG 250 

933 
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CCATGCGTTA CACCGAATCC CGCCTGTCGA 

AGCGAGCTGG GGCAGGGGAC GGTCGACTGG 

GCTGCAGGAG CCGAAAATGC TGCCAGCGCG 

ACGGCACCAC CGGCATCGCG GTAGGCATGG 

5 AACCTGCGGG AAGTGGCCAA AGCGGCGATT 

AACCACCCTC GACGAACTGC TGGATATCGT 

CCGAGGCGGA GATCATCACC TCGCGGGCGG 

AACGGGCGCG GCTCAGTGCG CATGCGCGCG 

CGCGGTGGTG ATCAGTGCGC TGCCGCATCA 

10 TGGAGCAGAT TGCGGCGCAG ATGCGC 



PCT/CA00/01150 

AGTATGCCGA GCTGCTGCTC 300 

GTGCCAAACT TTGACGGCAC 350 

TCTGCCGAAC ATCCTGCTGA 400 

CGACCGATAT TCCTCCGCAC 450 

ACGCTGATTG AGCAGCCGAA 500 

ACAGGGGCCG GATTTCCCGA 550 

AAATTCGCAA AATCTACCAG 600 

GTGTGGAGTA AAGAGGACGG 650 

GGTCTCTGGC GCCAAAGTGC 700 

726 



2) INFORMATION FOR SEQ ID NO: 1783 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 706 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
20 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
25 (A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 29011 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1783 

30 GGCTTAAAAC CGGTCCAGCG TCGCATCGTC TATGCGATGT CCGAGCTGGG 50 

GCTGAACGCC AGCGCGAAAT TCAAAAAGTC CGCCCGCACC GTCGGCGACG 100 

TGTTGGGTAA ATATCACCCG CACGGCGACA GCGCCTGCTA TGAAGCGATG 150 

GTGCTGATGG CGCAGCCGTT CTCTTACCGC TATCCGCTGG TGGATGGTCA 200 

GGGAAACTGG GGGGCGCCGG ACGATCCCAA ATCTTTTGCC GCCATGCGTT 250 

35 ACACCGAATC CCGCCTGTCG AAGTATGCCG AGCTGCTGCT CAGCGAGCTG 300 

GGGCAGGGGA CGGTCGACTG GGTGCCAAAC TTTGACGGCA CGCTGCAGGA 350 

GCCGAAAATG CTGCCAGCGC GTCTGCCGAA CATCCTGCTG AACGGCACCA 400 

CCGGCATCGC GGTAGGCATG GCGACCGATA TTCCTCCGCA CAACCTGCGG 450 

GAAGTGGCCA AAGCGGCGAT TACGCTGATT GAGCAGCCGA AAACCACCCT 500 

4 0 CGACGAACTG CTGGATATCG TACAGGGGCC GGATTTCCCG ACCGAGGCGG 550 

AGATCATCAC CTCGCGGGCG GAAATTCGCA AAATCTACCA GAACGGGCGC 600 

GGCTCAGTGC GCATGCGCGC GGTGTGGAGT AAAGAGGACG GCGCGGTGGT 650 

GATCAGTGCG CTGCCGCATC AGGTCTCCGG CGCCAAAGTG CTGGAGCAGA 700 

TTGCGG 706 

45 



2) INFORMATION FOR SEQ ID NO: 1784 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 614 bases 

(B) TYPE: Nucleic acici 

(C) STRANDEDNESS: Double 

934 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. 
rhinoscleromatis 

(B) STRAIN: ATCC 13884 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1784 

TTGGCGATGG CTTAAAACCG GTCCAGCGTC GCATCGTCTA TGCGATGTCC 50 

GAGCTGGGGC TGAACGCCAG CGCGAAATTC AAAAAGTCCG CCCGCACCGT 100 

CGGCGACGTG TTGGGTAAAT ATCACCCGCA CGGCGACAGC GCCTGCTATG 150 

AAGCGATGGT GCTGATGGCG CAGCCGTTCT CTTACCGCTA TCCGCTGGTG 200 

GATGGTCAGG GAAACTGGGG GGCGCCGGAC GATCCCAAAT CTTTCGCCGC 250 

CATGCGTTAC ACCGAATCCC GCCTGTCGAA GTATGCCGAG CTGCTGCTCA 300 

GCGAGCTGGG GCAGGGGACG GTCGACTGGG TGCCAAACTT TGACGGCACG 350 

CTGCAGGAGC CGAAAATGCT GCCAGCGCGT CTGCCGAACA TCCTGCTGAA 400 

CGGCACCACC GGCATCGCGG TAGGCATGGC GACCGATATT CCTCCGCACA 450 

ACCTGCGGGA AGTGGCCAAA GCGGCGATTA CGCTGATTGA GCAGCCGAAA 500 

ACCACCCTCG ACGAACTGCT GGATATCGTA CAGGGGCCGG ATTTCCCGAC 550 

CGAGGCGGAG ATCATCACCT CGCGGGCGGA AATTCGCAAA ATCTACCAGA 600 

ACGGGCGCGG. CTCA 614 



2) INFORMATION FOR SEQ ID NO: 1785 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella terrigena 

(B) STRAIN: ATCC 33257 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1785 

GCCTGAAACC GGTCCAGCGG CGCATCGTTT ATGCGATGTC CGAACTGGGT 50 

CTGAACGCCA CCGCTAAATT CAAAAAATCC GCGCGCACCG TCGGCGACGT 100 

GCTGGGTAAA TATCACCCGC ACGGCGATAG CGCCTGCTAT GAGGCGATGG 150 

TGCTGATGGC GCAGCCGTTC TCTTACCGCT ACCCGCTGGT GGACGGTCAG 200 

GGCAACTGGG GCGCCCCGGA CGATCCCAAA TCCTTCGCCG CGATGCGTTA 250 

TACCGAATCC CGCCTGTCAA AGTATGCGGA GCTGCTGCTG GGCGAGCTGG 300 

GTCAGGGAAC CGTTGACTGG GTACCTAACT TTGACGGTAC GATGCAGGAG 350 

CCGAAAATGC TGCCTGCGCG TTTGCCGAAT ATTCTGCTCA ACGGCACCAC 400 

CGGTATCGCC GTGGGGATGG CCACCGATAT TCCGCCGCAC AACCTGCGCG 4 50 

AAGTGGCCAA AGCGGCCATC ACCCTGATTG AAAAGCCGCA GACCTCGCTC 500 
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GACGAACTGC TGGATATCGT TCACGGGCCG GACTACCCCA CCGAAGCTGA 
AATCATCACC CCGCGCGCCG AGATCCGCAA AATCTATCAG AACGGTCGCG 
GCTCGGTTCG CATGCGTGCG GTGTGGAAAA AAGAGGACGG CGCGGTGGTG 
ATTAGCGCCC TGCCGCAT 



550 
600 
650 
668 



2) INFORMATION FOR SEQ ID NO: 1786 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 7064 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1786 

CATTACGTTC TAACACTCAA GGACGCGGAA CATTCTCTAT GGTGTTTGAC 50 
CACTATGAAG AAGTACCAAA GTCTGTTTCT GAAGAAATTA TCAAAAAAAA 100 
TAAAGGTGAA TAA 113 



2) INFORMATION FOR SEQ ID NO: 1787 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 14579 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1787 

AACGTCATTA CGTTCTAACA CTCAAGGACG CGGAACATTC TCTATGGTGT 50 
TTGACCACTA TGAAGAAGTA CCAAAGTCTG TTTCTGAAGA AATTATCAAA 100 
AAAAATAAAG GTGAATAA 118 



2) INFORMATION FOR SEQ ID NO: 1788 
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(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 120 bases 
<B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9444 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1788 

GCAACGTCAT TACGTTCTAA CACTCAAGGA CGCGGAACAT TCTCTATGGT 50 
GTTTGACCAC TATGAAGAAG TACCAAAGTC TGTTTCTGAA GAAATTATCA 100 
AAAAAAATAA AGGTGAATAA 120 



2) INFORMATION FOR SEQ ID NO: 1789 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 13472 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1789 

AACGTCATTA CGTTCTAACA CTCAAGGACG CGGAACATTC TCTATGGTGT 50 
TTGACCACTA TGAAGAAGTA CCAAAGTCTG TTTCTGAAGA AATTATCAAA 100 
AAAAATAAAG GTGAATAA 118 



2) INFORMATION FOR SEQ ID NO: 1790 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: ATCC 4229 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1790 



GCAACGTCAT TACGTTCTAA CACTCAAGGA CGCGGAACAT TCTCTATGGT 
GTTTGACCAC TATGAAGAAG TACCAAAGTC TGTTTCTGAA GAAATTATCA 
AAAAAAATAA AGGTGAATAA 



50 
100 
120 



2) INFORMATION FOR SEQ ID NO: 1791 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus pseudomycoides 

(B) STRAIN: NRRL B-617 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1791 

GCAACGTCAT TACGTTCTAA CACGCAAGGA CGCGGAACAT TCTCAATGAC 50 
ATTTGATCAT TATGAAGAAG TACCGAAGTC TGTTTCAGAA GAAATTATCA 100 
AAAAAAATAA AGGTGAATAA 120 



2) INFORMATION FOR SEQ ID NO: 1792 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 116 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 49064 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1792 

CGTCATTACG TTCTAACACT CAAGGACGCG GAACATTCTC TATGGTGTTT 50 
GACCACTATG AAGAAGTACC AAAGTCTGTT TCTGAAGAAA TTATCAAAAA 100 
AAATAAAGGT GAATAA 116 
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2) INFORMATION FOR SEQ ID NO: 1793 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9440 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1793 

GCAACGTCAT TACGTTCTAA CACTCAAGGA CGCGGAACAT TCTCTATGGT 50 

GTTTGACCAC TATGAAGAAG TACCAAAGTC TGTTTCTGAA GAAATTATCA 100 

AAAAAAATAA AGGTGAATAA 120 

20 



10 



2) INFORMATION FOR SEQ ID NO: 1794 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 100 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 
35 (B) STRAIN: ATCC 15816 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1794 

CACTCAAGGA CGCGGAACAT TCTCTATGGT GTTTGATCAC TATGAAGAAG 50 
40 TACCAAAGTC TGTTTCTGAA GAAATTATCA AAAAAAATAA AGGTGAATAA 100 



2) INFORMATION FOR SEQ ID NO: 1795 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus weihenstephanensis 

(B) STRAIN: WSBC 10204 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1795 

GCAACGGCAT TACGTTCTAA CACTCAAGGA CGCGGAACAT TCTCAATGAC 50 
ATTTGATCAT TATGAAGAAG TACCGAAGTC TGTTTCTGAA GAAATTATTA 100 
AAAAAAATAA AGGTGAATAA 120 



2) INFORMATION FOR SEQ ID NO: 1796 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus mycoides 
25 (B) STRAIN: ATCC 6462 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1796 

GCGACAGCAT TACGTTCTAA CACTCAAGGA CGCGGAACAT TCTCAATGAC 50 
30 ATTTGATCAT TATGAAGAAG TACCGAAGTC TGTTTCAGAA GAAATTATTA 100 
AAAAAAATAA AGGCGAATAA 120 



35 2) INFORMATION FOR SEQ ID NO: 1797 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 120 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: ATCC 10792 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1797 

GCAACGTCAT TACGTTCTAA CACTCAAGGA CGCGGAACAT TCTCTATGGT 50 
GTTTGACCAC TATGAAGAAG TACCAAAGTC TGTTTCTGAA GAAATTATCA 100 
AAAAAAATAA AGGTGAATAA 120 
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2) INFORMATION FOR SEQ ID NO: 1798 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS; Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus weihenstephanensis 

(B) STRAIN: WSBC 10204 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1798 

TTGATTTTTA TCGATTGTTC AAGTATAACT ACTTATGTAA GCTTAGAAAG 50 
TGAGACGCAA GTTTCACTTT CTAGTCTAAA TATAAAATAA CCCATATAAA 100 
CTAAGGAGGA ATTTAGA 117 



2) INFORMATION FOR SEQ ID NO: 1799 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: ATCC 10792 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1799 

TTGATTTTTA TCGATTGTTC AAGTATAACT ACTTATGTAA GCTTAGAAAG 50 
TGGGACGTAA GTTTCACTTT CTAGTCTAAA TATAAAATAA CCTATATAAA 100 
CTAAGGAGGA ATTTAGA 117 



2) INFORMATION FOR SEQ ID NO: 1800 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) . TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

941 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: ATCC 4229 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1800 



TTGATTTTTA TCGATTCTTC ACGTATAACT ACTTATGTAA GCTTAGAAAG 
TGGGACGCAA GTTTCGCTTT CTAGCCTAAA TATAAAATAA CCTATATAAA 
CTAAGGAGGA ATTTAGA 



50 
100 
117 



2) INFORMATION FOR SEQ ID NO: 1801 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus pseudomycoides 

(B) STRAIN: NRRL B-617 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1801 

TTGATTTTTA TCGATTGTTC AAGTATAACT ACTTATGTAA GCTTAGAAAG 50 
TGGGACTTAA GTTTCACTTT CTAGTCTAAA TATAAAATAA CCTATATAAA 100 
CTAAGGAGGA ATTTAGA 117 



2) INFORMATION FOR SEQ ID NO: 1802 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9444 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1802 

TTGATTTTTA TCGATTCTTC ACGTATAACT ACTTATGTAA GCTTAGAAAG 50 
TGGGACGCAA GTTTCGCTTT CTAGCCTAAA TATAAAATAA CCTATATAAA 100 
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5 2) INFORMATION FOR SEQ ID NO: 1803 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 
10 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

15 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 7064 



20 



25 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1803 

TTGATTTTTA TCGATTCTTC ACGTATAACT ACTTATGTAA GCTTAGAAAG 50 

TGAGACGCAA GTTTCGCTTT CTAGCCTAAA TATAAAATAA CCTATATAAA 100 

CTAAGGAGGA ATTTAGA 117 



2) INFORMATION FOR SEQ ID NO: 1804 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM Bacillus cereus 

(B) STRAIN : ATCC 49064 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1804 



TTGATTTTTA TCGATTCTTC ACGTATAACT ACTTATGTAA GCTTAGAAAG 50 
TGGGACGCAA GTTTCACTTT CTAGCCTAAA TATAAAATAA CCTATATAAA 100 
45 CTAAGGAGGA ATTTAGA 1 117 



2) INFORMATION FOR SEQ ID NO: 1805 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

5 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus mycoides 

(B) STRAIN: ATCC 6462 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1805 

TTGATTTTTA TCAATTGTTC GAGTATAACT ACTTATGTAA GCTTAGAAAG 50 

TGGGACGTAA GTTTCGCTTT CTAGTCTAAA TATAAAATAA CCTATATAAA 100 

CTAAGGAGGA ATTTAGA 117 

15 



2) INFORMATION FOR SEQ ID NO: 1806 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 
30 (B) STRAIN: ATCC 14579 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1806 

TTGATTTTTA TCGATTGTTC AAGTATAACT ACTTATGTAA GCTTAGAAAG 50 
35 TGGGACGTAA GTTTCACTTT CTAGTCTAAA TATAAAATAA CCTATATAAA 100 
CTAAGGAGGA ATTTAGA 117 



40 2) INFORMATION FOR SEQ ID NO: 1807 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 15816 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1807 

TTGATTTTTA TCGATTCTTC ACGTATAACT ACTTATGTAA GCTTAGAAAG 50 

TGGGACGCAA GTTTCGCTTT CTAGCCTAAA TATAAAATAA CC TAT AT AAA 100 

5 CTAAGGAGGA ATTTAGA 117 



2) INFORMATION FOR SEQ ID NO: 1808 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 117 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
20 (A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 13472 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1808 

25 TTGATTTTTA TCGATTGTTC AAGTATAACT ACTTATGTAA GCTTAGAAAG 50 
TGGGACGTAA GTTTCACTTT CTAGTCTAAA TATAAAATAA CCTATATAAA 100 
CTAAGGAGGA ATTTAGA 117 



30 



40 



2) INFORMATION FOR SEQ ID NO: 1809 



(i) SEQUENCE CHARACTERISTICS: 
(A)' LENGTH: 117 bases 
35 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9440 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1809 

TTGATTTTTA TCGATTCTTC ACGTATAACT ACTTATGTAA GCTTAGAAAG 50 

TGGGACGCAA GTTTCGCTTT CTAGCCTAAA TATAAAATAA CCTATATAAA 100 

CTAAGGAGGA ATTTAG 117 

50 



2) INFORMATION FOR SEQ ID NO: 1810 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus mycoides 

(B) STRAIN: ATCC 6462 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1810 

ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 50 

AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 100 

CAGTTCTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGATCAAATC 150 

GACGCTGCTC CAGAAGAAAG AGAGCGCGGA ATCACAATCT CAACTGCACA 200 

CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 250 

GTCACGCTGA CTATGTTAAA AACATGAT 278 



2) INFORMATION FOR SEQ ID NO: 1811 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus thuringiensis 

(B) STRAIN: ATCC 10792 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1811 

ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 50 

AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 100 

CAGTTCTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGATCAAATC 150 

GATGCTGCTC CAGAAGAAAG AGAGCGCGGT ATCACAATCT CAACTGCACA 200 

CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 250 

GTCACGCTGA CTATGTTAAA AACATGAT 278 



2) INFORMATION FOR SEQ ID NO: 1812 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 270 bases 

946 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 15816 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1812 

ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 50 

AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 100 

CAGTACTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGATCAAATC 150 

GATGCTGCTC CAGAAGAAAG AGAGCGCGGT ATCACAATCT CAACTGCACA 200 

CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 250 

GTCACGCTGA CTATGTTAAA 270 



2) INFORMATION FOR SEQ ID NO: 1813 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus weihenstephanensis 

(B) STRAIN: WSBC 10204 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1813 

ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 50 

AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 100 

CAGTTCTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGATCAAATC 150 

GACGCTGCTC CAGAAGAAAG AGAGCGCGGA ATCACAATCT CAACTGCACA 200 

CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 250 

GTCATGCTGA CTATGTTAAA AACATGAT 278 



2) INFORMATION FOR SEQ ID NO: 1814 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 266 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: CIP 9440 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1814 

ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 50 

AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 100 

CAGTACTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGATCAAATC 150 

GATGCTGCTC CAGAAGAAAG AGAGCGCGGT ATCACAATCT CAACTGCACA 200 

CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 2 50 

GTCACGCTGA CTATGT 266 



2) INFORMATION FOR SEQ ID NO: 1815 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 7064 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1815 

ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 50 

AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 100 

CAGTACTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGATCAAATC 150 

GATGCTGCTC CAGAAGAAAG AGAGCGCGGT ATCACAATCT CAACTGCACA 200 

CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 250 

GTCACGCTGA CTATGTTAA 269 



2) INFORMATION FOR SEQ ID NO: 1816 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 268 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 13472 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1816 

ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 50 

AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 100 

CAGTTCTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGATCAAATC 150 

10 GATGCTGCTC CAGAAGAAAG AGAGCGCGGT ATCACAATCT CAACTGCACA 200 

CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 250 

GTCACGCTGA CTATGTTA 2 68 



15 



25 



2) INFORMATION FOR SEQ ID NO: 1817 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 278 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 

(B) STRAIN: ATCC 4229 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1817 

ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 50 

AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 100 

CAGTACTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGATCAAATC 150 

3 5 GATGCTGCTC CAGAAGAAAG AGAGCGCGGT ATCACAATCT CAACTGCACA 200 

CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 250 

GTCACGCTGA CTATGTTAAA AACATGAT 278 



40 



50 



2) INFORMATION FOR SEQ ID NO: 1818 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 268 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN: ATCC 14579 . 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1818 

ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 50 

5 AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 100 

CAGTTCTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGATCAAATC 150 

GATGCTGCTC CAGAAGAAAG AGAGCGCGGT ATCACAATCT CAACTGCACA 200 

CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 250 

GTCACGCTGA CTATGTTA 268 



2) INFORMATION FOR SEQ ID NO: 1819 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus anthracis 
25 (B) STRAIN: CIP 9444 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1819 

ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 50 

30 AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 100 

CAGTACTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGATCAAATC 150 

GATGCTGCTC CAGAAGAAAG AGAGCGCGGT ATCACAATCT CAACTGCACA 200 

CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 250 

GTCACGCTGA CTATGTTAAA AACATGAT 278 

35 



2) INFORMATION FOR SEQ ID NO: 1820 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 278 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus pseudomycoides 
50 (B) STRAIN: NRRL B-617 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1820 



950 
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ATGGCTAAAG CTAAATTCGA ACGTTCTAAA 
AATCGGCCAC GTTGACCATG GTAAAACTAC 
CAGTTCTTGC AAAAGCTGGT GGTGCTGAAG 
GATGCTGCTC CAGAAGAAAG AGAGCGCGGT 
5 CGTTGAGTAC GAAACTGAAA CTCGTCACTA 
GTCACGCTGA CTATGTTAAA AACATGAT 



PCT/CA0O/O1150 

CCCCATGTTA ACATCGGTAC 50 
ATTAACTGCT GCGATCACTA 100 
CACGCGGATA CGACCAAATC 150 
ATCACAATCT CAACTGCACA .200 
TGCACACGTT GACTGCCCAG 250 

278 



10 2) INFORMATION FOR SEQ ID NO: 1821 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bacillus cereus 

(B) STRAIN : ATCC 49064 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1821 



ATGGCTAAAG CTAAATTCGA ACGTTCTAAA CCCCATGTTA ACATCGGTAC 50 

AATCGGCCAC GTTGACCATG GTAAAACTAC ATTAACTGCT GCGATCACTA 100 

CAGTACTTGC AAAAGCTGGT GGTGCTGAAG CACGCGGATA CGATCAAATC 150 

GATGCTGCTC CAGAAGAAAG AGAGCGCGGT ATCACAATCT CAACTGCACA 200 

30 CGTTGAGTAC GAAACTGAAA CTCGTCACTA TGCACACGTT GACTGCCCAG 250 

GTCACGCTGA CTA 263 



35 2) INFORMATION FOR SEQ ID NO: 1822 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1668 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

45 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 35037 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1822 

CAATCGAAGT ACAACGTTCT CTTCGWGTAT TGGACGGTGC GGTTACTGTT 50 
CTTGACTCAC AATCAGGTGT TGAGCCTCAA ACTGAAACAG TTTGGCGTCA 100 
AGCAACTGAG TACGGAGTTC CACGTATCGT ATTTGCTAAC AAAATGGACA 150 

951 
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WO 01/23604 PCT/CA00/01 150 

AAATCGGTGC TGACTTCCTT TACTCAGTAA GCACACTTCA CGACCGTCTT 200 

CAAGCAAACG CACACCCAAT CCAATTGCCA ATCGGTGCTG AAGATGACTT 250 

CCGTGGTATC ATCGACTTGA TCAAGATGAA AGCTGAAATC TATACTAACG 300 

ACCTTGGTAC AGATATCCTT GAAGAAGATA TTCCAGCTGA ATACCTTGAC 350 

5 CAAGCTCAAG AATACCGTGA AAAATTGGTT GAAGCAGTCG CTGAAACTGA 400 

TGAAGACTTG ATGATGAAAT ACCTTGAAGG TGAAGAAATC ACTAACGAAG 450 

AATTGAAAGC TGCTATCCGT AAAGCAACTA TCAACGTTGA ATTCTTCCCA 500 

GTATTGTGTG GTTCTGCCTT CAAGAACAAG GGTGTTCAAT TGATGCTTGA 550 

TGCGGTTATC GACTACCTTC CAAGCCCACT TGATATCCCA GCGATCAAAG 600 

10 GTATCAACCC AGATWCAGAT GAAGAAGAAA CTCGTCCAGC ATCTGACGAA 650 

GAGCCATTCG CAGCTCTTGC CTTCAAGATC ATGACGGACC CATTTGTAGG 700 

TCGTTTGACA TTCTTCCGTG TATACTCARG TGTTCTCCAA TCARGKTCTT 750 

ACGTATTGAA CACATCTAAA GGTAAACGTG AACGTATCGG ACGTATCCTT 800 

CAAATGCACG CTAACAGCCG TCAAGAAATT GACACTGTTT ACTCAGGTGA 850 

15 TATCGCTGCT GCCGTTGGTT TGAAAGATAC TYCAACTGGT GACTCATTGM 900 

CAGATGAAAA AGCTAAAATC ATCCTTGAGT CAATCAACGT TCCAGAMCCA 950 

GTTATCCAAT TGATGGTTGA GCCAAAATCT AAAGCTGACC AAGATAAGAT 1000 

GGGTATCGCC CTTCAAAAAT TGGCTGAAGA AGATCCAACA TTCCGCGTTG 1050 

AAMCAAACGT TGAAACTGGT GAAMCAGTTA TCTCAGGTAT GGGTGAGCTT 1100 

2 0 CACTTGGACG TCCTTGTTGA CCGTATGCGT CGTGAGTTCA AAGTGGAAGC 1150 

GAACGTAGGT GCTCCTCAAG TATCTTACCG TGAAACATTC CGCGCTTCTA 1200 

CTCAAGCACG TGGATTCTTC AAACGTCAGT CTGGTGGTAA AGGTCAATTC 1250 

GGTGATGTAT GGATTGAATT TACTCCAAAC GAAGAAGGTA AAGGATTCGA 1300 

ATTCGAAAAC GCAATCGTCG GTGGTGTGGT TCCTCGTGAA TTTATCCCAG 1350 

25 CGGTTGAAAA AGGTTTGGTA GAATCTATGG CTAACGGTGT TCTTGCAGGT 1400 

TACCCAATGG TTGACGTTAA AGCTAAGCTT TACGATGGTT CATACCACGA 14 50 

TGTCGACTCA TCTGAAACTG CCTTCAAGAT CGCGGCTTCA CTTGCCCTTA 1500 

AAGAAGCTGC TAAGTCAGCA CAACCAGCTA TCCTTGAGCC AATGATGCTT 1550 

GTAACCATCA CTGTTCCAGA AGAAAACCTT GGTGATGTTA TGGGTCACGT 1600 

30 AACTGCTCGT CGTGGACGTG TAGATGGTAT GGAAGCACWC GGTAACAGCC 1650 

AAATCGTTCG TGCTTACG 1668 



35 2) INFORMATION FOR SEQ ID NO: 1823 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 bases 

(B) TYPE: Nucleic acid 
40 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

4 5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Budvicia aquatica 

(B) STRAIN: ATCC 35567 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1823 

AGACCTGCGT TCACAAACAC AGGGTCGTGC TTCTTACTCT ATGGAGTTCT 50 
TGAAGTACAA CGAAGCGCCA AACAACGTTG CTACAGCAAT CATTGAAGCT 100 
CGTAAGGCTA GATAA 115 
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2) INFORMATION FOR SEQ ID NO: 1824 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 107 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Buttiauxella agrestis 

(B) STRAIN: ATCC 33320 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1824 

CTGCGTTCAC TGACCAAGGT CGTGCATCTT ACTCCATGGA ATTCCTGAAG 50 
TATGATGACG CGCCAAACAA CGTAGCTCAG GCCGTAATCG AAGCTCGCGG 100 
TAAATAA 107 



2) INFORMATION FOR SEQ ID NO: 1825 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 79 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 13182 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1825 

TTACTCCATG GAGTTCCTGA AGTATGATGA TGCGCCGAAC AACGTTGCTC 50 
AGGCCGTAAT CGAAGCCCGT GGTAAATAA 79 



2) INFORMATION FOR SEQ ID NO: 1826 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 111 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Plesiomonas shigelloides 

(B) STRAIN: ATCC 14029 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1826 



CAGCTGCGTT CTCTGACCAA AGGTCGTGCA TCATACACTA TGGAATTCCT 
GAAGTATGAT GATGCGCCAA ACAACGTTGC TCAGGCCGTT ATTGAAGCCC 
GTGGTAAGTA A 



50 
100 
111 



2) INFORMATION FOR SEQ ID NO: 1827 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 108 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shewanella putrefaciens 

(B) STRAIN: ATCC 8071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1827 

GATTTGCGCT CTGCAACTCA TGGGCGTGCT TCGTACTCCA TGGAGTTCTT 50 
GAAGTACTCT GATGCACCGC AAAACATTGC GAAAGCGATT ATTGAATCTC 100 
GTAGCTAA 108 



2) INFORMATION FOR SEQ ID NO: 1828 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Obesumbacterium proteus 

(B) STRAIN: ATCC 12841 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1828 

CTCAGCTGCG TTCTCTGACC AAAGGTCGTG CATCTTACTC CATGGAATTC 50 
CTGAAGTATG ATGATGCGCC TAACAACGTT GCTCAGGCCG TTATTGAAGC 100 
TCGTGGCAAA TAA 113 
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2) INFORMATION FOR SEQ ID NO: 1829 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 

(B) STRAIN: ATCC 13182 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1829 

GCCGCAGGGT TAAAACCAAA GTCCCGTGCT CTCTCCTGAA GGGGAGAGCA 50 
CTATAGTAAG GAATATAGCC 70 



10 



20 



35 



2) INFORMATION FOR SEQ ID NO: 1830 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 66 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

30 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Budvicia aquatica 

(B) STRAIN: ATCC 35567 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1830 

GCCTCGGGTA AAACTTATAT CCCAGTCCCC CTCGTATAGA GGGGGATAGA 50 
GTAAAGGAAG ATAATC 66 



40 



2) INFORMATION FOR SEQ ID NO: 1831 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



50 



(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Plesiomonas shigelloides 

(B) STRAIN: ATCC 14029 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1831 



TCCACAGGAT TAAAACCCAG GTTTAAACCT AAGTCCCGTG CTCTCTCCTC 
AGGGGAGAGC ACAATAGTAA GGAATATAGC C 



50 
81 



2) INFORMATION FOR SEQ ID NO: 1832 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Obesumbacterium proteus 

(B) STRAIN: ATCC 12841 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1832 

GCTACTAGTT TAAAACATTG ATCCCGTGCT CTCTCTATGA AGGGAGAGCA 50 
CAAGAGTAAG GAATAAAGCC 70 



2) INFORMATION FOR SEQ ID NO: 1833 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 72 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shewanella putrefaciens 

(B) STRAIN: ATCC 8071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1833 

TTTCCAGTTA CGACATAAAT GTTATTATGG TCCAGCTTTG ACTGGACTAT 50 
TCTGAAAAGA AAGGAATATA TC 72 



2) INFORMATION FOR SEQ ID NO: 1834 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Buttiauxella agrestis 

(B) STRAIN: ATCC 33320 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1834 

GCCCCGGGTT TTAAAAAACA TTGATCCCGT GCTCTCTCCA GAAGGGGAGA 50 
GCGCAACAGT AAGGAATATA GCC 7 3 



2) INFORMATION FOR SEQ ID NO: 1835 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear. 

MOLECULE TYPE: Genomic DNA 
ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter coli 

(B) STRAIN: ATCC 43479 

(xi) SEQUENCE DESCRIPTION: SEQ ID, NO: 1835 

CTGCAGCTGA TGGTCCTATG CCACAAACTA GAGAGCACAT CCTTCTATCA 50 

CGCCAAGTAG GTGTTCCATA TATCGTTGTA TTTATGAATA AAGCAGATAT 100 

GGTTGATGAT GCTGAACTTT TAGAATTGGT TGAAATGGAA ATTAGAGAAT 150 

TATTAAGCTC TTATGATTTC CCAGGTGATG ACACACCTAT TATTTCAGGT 200 

TCTGCTTTAA AAGCTCTTGA AGAAGCAAAA GCTGGACAAG ATGGCGAATG 250 

GTCAGCGAAA ATTATGGATC TTATGGCTGC TGTTGATAGC TATATTCCAA 300 

CTCCAACTCG TGACACTGAA AAAGATTTCT TAATGCCAAT TGAAGATGTT 350 

TTCTCAATTT CAGGTCGTGG TACTGTTGTT ACAGGTAGAA TTGAAAAAGG 400 

TATTGTAAAA GTTGGTGATA CTATAGAAAT CGTTGGTATT AAAGATACTC 450 

AAACAACAAC TGTAACTGGC GTTGAAATGT TTAGAAAAGA AATGGACCAA 500 

GGTGAAGCAG GGGATAATGT TGGTGTTCTT CTTCGTGGTA CAAAAAAAGA 550 

AGAAGTTATC CGCGGTATGG TTCTTGCTAA ACCAAAATCA ATTACTCCAC 600 

AT ACT GAT TT CGAAGCTGAA GTTTATATCC TAAATAAAGA TGAGGGTGGT 650 

AGACATACTC CATTCTTTAA TAACTATAGA CCGCAATTCT ATGTAAGAAC 700 

AACAGATGTA ACAGGTTCTA TTAAATTAGC TGATGGCGTT GAAATGGTTA 750 

TGCCTGGTGA AAATGTAAGA ATTACTGTAA GCTTGATTGC ACCAG 7 95 



(ii) 
(vi) 
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2) INFORMATION FOR SEQ ID NO: 1836 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter fetus subsp. fetus 

(B) STRAIN: ATCC 25936 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1836 

GCCATACTAG TTGTTTCTGC AGCTGATGGC CCAATGCCAC AAACTAGAGA 50 

GCACATTTTG CTATCTCGTC AAGTTGGTGT TCCATATATA GTTGTTTTTA 100 

TGAACAAAGC TGATATGGTA GATGACGCAG AGTTGCTAGA ATTAGTTGAA 150 

ATGGAGATCA GAGAGTTATT AAGCGAATAT GACTTCCCTG GT GAT GAT AC 200 

TCCTATTATA AGCGGATCAG CACTTCAAGC TCTTGAAGAA GCTAAAGCTG 250 

GTAATGATGG CGAATGGTCA GCTAAGATTA TGGATCTTAT GGCTGCTGTT 300 

GATAGCTACA TACCAACTCC AGTTCGTGCT ACTGATAAAG ATTTCTTAAT 350 

GCCGATTGAA GACGTATTCT CAATTTCTGG CCGTGGTACT GTTGTTACTG 400 

GTAGAATTGA AAAAGGTATA GTTAAAGTTG GTGATACTAT CGAAATCGTA 450 

GGTATTAGAG ATACACAAAC TACAACAGTT ACCGGCGTTG AAATGTTTAG 500 

AAAAGAAATG GATCAAGGCG AGGCTGGTGA TAACGTTGGT GTTCTTTTAC 550 

GCGGTACAAA GAAAGAAGAC GTTGAAAGAG GTATGGTTCT TTGTAAGCCA 600 

AAATCAATTA CTCCTCATAC TAAATTTGAG GGAGAAGTTT ATATCTTGAC 650 

TAAGGAAGAG GGCGGTAGAC ATACTCCATT CTTCAACAAC TATAGACCAC 700 

AATTTTATGT AAGAACAACA GATGTTACTG GATCAATCAC TCTTCCAGAG 750 

GGTACTGAGA TGGTTATGCC TGGTGATAAC TTAAAAATCA CTGTTGAGTT 800 

AATCAACCCA GTTGCTC 817 



2) INFORMATION FOR SEQ ID NO: 1837 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter fetus, subsp. venerealls 

(B) STRAIN: ATCC 33561 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1837 

CCATACTAGT TGTTTCTGCA GCTGATGGCC CAATGCCACA AACTAGAGAG 50 
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CACATTTTGC 
GAACAAAGCT 
TGGAGATCAG 
CCTATTATAA 
TAATGATGGC 
ATAGCTACAT 
CCGATTGAAG 
TAGAATTGAA 
GTATTAGAGA 
AAAGAAATGG 
CGGTACAAAG 
AATCAATTAC 
AAGGAAGAGG 
ATTTTATGTA 
GTACTGAGAT 



TATCTCGTCA 
GATATGGTAG 
AGAGTTATTA 
GCGGATCAGC 
GAATGGTCAG 
ACCAACTCCA 
ACGTATTCTC 
AAAGGTATAG 
TACACAAACT 
ATCAAGGCGA 
AAAGAAGACG 
TCCTCATACT 
GCGGTAGACA 
AGAACAACAG 
GGTTATGCCT 



AGTTGGTGTT 
ATGACGCAGA 
AGCGAATATG 
ACTTCAAGCT 
CTAAGATTAT 
GTTCGTGCTA 
GATTTCTGGC 
TTAAAGTTGG 
ACAACAGTTA 
GGCTGGTGAT 
TTGAAAGAGG 
AAATTTGAGG 
TACTCCATTC 
ATGTTACTGG 
GGTGATAACT 



CCATATATAG 
GTTGCTAGAA 
ACTTCCCTGG 
CTTGAAGAAG 
GGATCTTATG 
CTGATAAAGA 
CGTGGTACTG 
TGATACTATC 
CCGGCGTTGA 
AACGTTGGTG 
TATGGTTCTT 
GAGAAGTTTA 
TTCAACAACT 
ATCAATCACT 
TAAAAATCAC 



TTGTTTTTAT 
TTAGTTGAAA 
TGATGATACT 
CTAAAGCTGG 
GCTGCTGTTG 
TTTCTTAATG 
TTGTTACTGG 
GAAATCGTAG 
AATGTTTAGA 
TTCTTTTACG 
TGTAAGCCAA 
TATCTTGACT 
ATAGACCACA 
CTTCCAGAGG 
TGTTAGTT 



100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
798 



2) INFORMATION FOR SEQ ID NO: 1838 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1116 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Buttiauxella agrestis 

(B) STRAIN: ATCC 33320 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1838 

GTGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTCA ACGTCGGTAC 50 

TATCGGCCAC GTTGACCATG GTAAAACTAC TCTGACTGCA GCAATCACTA 100 

CCGTTCTGGC TAAAACCTAC GGCGGTTCTG CACGCGCATT CGACCAGATC 150 

GATAACGCAC CAGAAGAAAA AGCTCGTGGT ATCACCATCA ACACTTCCCA 200 

CGTTGAATAT GACACCCCGA CTCGTCACTA CGCGCACGTT GACTGCCCAG 250 

GGCACGCCGA CTACGTTAAA AACATGATCA CCGGTGCTGC TCAGATGGAC 300 

GGCGCTATCC TGGTTGTTGC TGCGACTGAT GGCCCGATGC CACAGACTCG 350 

TGAGCACATC CTGCTGGGTC GCCAGGTTGG CGTTCCATTC ATGATCGTGT 4 00 

TCATGAACAA ATGTGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 450 

GAAATGGAAG TTCGTGAACT TCTGTCTGCT TATGATTTCC CGGGCGACGA 500 

CATCCCAGTG GTTCGTGGTT CAGCTCTGAA AGCGCTGGAA GGCGAAGCAG 550 

AGTGGGAAGC TAAAATCATC GAGCTGGCTG GTCACCTGGA TAACTACATC 600 

CCAGAACCAG AGCGTGCTAT CGACAAGCCA TTCCTGCTGC CAATCGAAGA 650 

CGTATTCTCC ATCTCCGGCC GTGGTACTGT TGTTACTGGT CGTGTAGAGC 700 

GTGGTATCAT TAAAGTTGGT GAAGAAGTAG AAATCGTTGG TATCAAAGAT 750 

ACCGTGAAAT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 800 

CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATTAAAC 850 

GTGAAGATAT CGAACGTGGT CAGGTTCTGG CTAAGCCAGG CTCTATCAAG 900 

CCGCACACTC AGTTCGAATC AGAAGTTTAT ATCCTGTCCA AAGATGAAGG 950 
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CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA 
GTACAACTGA CGTGACTGGC ACCATCGAAC 
GTAATGCCGG GCGACAACAT TCAAATGGTT 
CGCAATGGAC GACGGT 



PCT/CAOO/01150 

CCGTCCACAG TTCTACTTCC 1000 
TGCCAGAAGG CGTTGAGATG 1050 
GTTACCCTGA TCCACCCAAT 1100 

1116 



2) INFORMATION FOR SEQ ID NO: 1839 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1109 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 
20 (B) STRAIN: ATCC 13182 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1839 

GTGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTCA ACGTCGGTAC 50 

25 TATCGGCCAC GTTGACCATG GTAAAACTAC TCTGACCGCT GCAATCACTA 100 

CCGTTCTGGC TAAAACCTAC GGTGGTGCTG CTCGCGCATT CGACCAGATC 150 

GATAACGCGC CGGAAGAAAA AGCTCGTGGT ATCACCATCA ACACTTCCCA 200 

CGTTGAATAT GACACCCCGA CTCGCCACTA CGCGCACGTA GACTGCCCGG 250 

GCCACGCCGA CTATGTTAAA AACATGATCA CCGGTGCTGC GCAGATGGAC 300 

3 0 GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCGATGC CGCAGACTCG 350 

TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCGTAC ATCATCGTGT 400 

TCCTGAACAA GTGCGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTT 450 

GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGATTTCC CGGGCGACGA 500 

CACTCCGATC GTTCGTGGTT CTGCTCTGAA AGCTCTGGAA GGCGACGCTG 550 

3 5 AGTGGGAATC TAAAATCATC GAACTGGCTG GCTTCCTGGA TTCTTATATT 600 

CCGGAACCAG AGCGTGCGAT TGACAAGCCG TTCCTGCTGC CGATCGAAGA 650 

CGTATTCTCC ATCTCCGGTC GTGGTACCGT TGTTACCGGT CGTGTAGAGC 700 

GCGGTATCAT CAAAGTTGGC GAAGAAGTTG AAATCGTTGG TATCAAAGAC 750 

ACTGCTAAGT CTACCTGTAC TGGCGTTGAA ATGTTCCGCA AACTGCTGGA 800 

40 CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTATCAAAC 850 

GTGAAGAAAT CGAACGTGGT CAGGTACTGG CTAAGCCGGG CTCTATCAAG 900 

CCGCACACCA AGTTCGAATC TGAAGTTTAT ATCCTGTCCA AAGACGAAGG 950 

CGGCCGTCAC ACTCCGTTCT TCAAAGGCTA CCGTCCGCAG TTCTACTTCC 1000 

GTACAACTGA CGTGACTGGC ACCATCGAAC TGCCGGAAGG CGTAGAGATG 1050 

45 GTTATGCCGG GCGACAACAT CAAAATGGTT GTTACCCTGA TCCACCCGAT 1100 

CGCGATGGA 1109 



50 2) INFORMATION FOR SEQ ID NO: 1840 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1108 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Plesiomonas shigelloides 

(B) STRAIN: ATCC 14029 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1840 



GTGTCTAAAG AAAAATTTGA ACGTACTAAA CCGCACGTTA ACGTTGGTAC 50 

TATCGGCCAC GTTGACCACG GTAAAACTAC CCTGACTGCA GCTATCACTA 100 

15 CCGTACTGTC TAAAGTATAC GGTGGTCAGG CTCGTGCATT CGATCAGATC 150 

GATAACGCGC CAGAAGAAAA AGCTCGTGGT ATCACCATCA ACACTTCTCA 200 

CGTAGAGTAC GACACCCCAA CTCGTCACTA CGCGCACGTT GACTGCCCAG 250 

GTCACGCCGA CTACGTGAAG AACATGATCA CTGGTGCTGC TCAGATGGAC 300 

GGCGCTATCC TGGTAGTAGC TGCGACTGAC GGCCCAATGC CTCAGACTCG 350 

20 TGAGCACATC CTGCTGGGTC GTCAGGTAGG CGTTCCTTAC ATCATCGTGT 400 

TCCTGAACAA GTGTGACATG GTTGATGACG AAGAGCTGCT GGAACTGGTA 4 50 

GAAATGGAAG TACGTGAGCT GCTGTCTCAG TACGACTTCC CAGGCGACGA 500 

TACTCCAGTT GTTCGCGGTT CTGCACTGAA AGCGCTGGAA GGCGATGCTC 550 

AGTGGGAAGA GAAGATTGTT GAACTGGCAG GCTACCTGGA CAGCTACATC 600 

25 CCTGAGCCAG AGCGTGCTAT CGACAAGCCA TTCCTGCTGC CAATCGAAGA 650 

CGTATTCTCT ATCTCCGGCC GTGGTACTGT AGTAACTGGT CGTGTAGAGC 700 

GCGGTATCAT CAAAGTTGGT GAAGAAGTAG AAATCGTTGG TATCAAAGAG 7 50 

ACTACCAAGA CTACTTGTAC TGGCGTTGAA ATGTTCCGTA AGCTGCTGGA 800 

CGAAGGTCGT GCGGGCGAGA ACGTAGGTGT TCTGCTGCGT GGTACCAAGC 850 

30 GTGATGACGT AGAGCGTGGT CAGGTTCTGG CTAAGCCAGG CTCAATCAAC 900 

CCACACACCA ACTTTGTAGC AGAAGTTTAT ATTCTGTCCA AAGATGAAGG 950 

TGGTCGTCAC ACTCCATTCT TCAAAGGCTA CCGTCCACAG TTCTACTTCC 1000 

GTACAACTGA CGTGACCGGT ACCATCGAAC TGCCAGAAGG CGTAGAGATG 1050 

GTAATGCCAG GTGACAACAT TCAAATGGTT GTTACCCTGA TTGCACCAAT 1100 

35 CGCGATGG 1108 



2) INFORMATION FOR SEQ ID NO: 1841 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1107 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
15 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: Shewanella putrefaciens 

(B) STRAIN: ATCC 8071 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1841 

961 
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TGGCAAAAGC TAAATTTGAA CGTATTAAGC CTCACGTAAA CGTGGGCACC 50 

ATTGGTCACG TTGACCATGG TAAAACCACT CTGACTGCAG CTATCTCTCA 100 

CGTACTGGCT AAGACCTACG GTGGCGAAGC TAAAGACTTC TCTCAAATCG 150 

ATAACGCTCC AGAAGAGCGT GAGCGCGGTA TTACCATCAA TACCTCTCAC 200 

ATCGAATATG ACACGCCATC ACGCCACTAC GCCCACGTAG ACTGCCCAGG 250 

CCACGCTGAC TATGTTAAAA ACATGATCAC TGGTGCTGCA CAGATGGACG 300 

GCGCGATTCT GGTAGTCGCT TCAACAGACG GTCCAATGCC ACAGACTCGT 350 

GAGCACATCC TGCTTTCTCG TCAGGTTGGC GTACCATTCA TCATCGTATT 400 

CATGAACAAA TGTGACATGG TAGATGACGA AGAGCTGTTA GAGCTAGTTG 4 50 

AGATGGAAGT GCGTGAACTG TTATCAGAAT ACGATTTCCC AGGTGATGAC 500 

TTACCGGTAA TCCAAGGTTC AGCTCTGAAA GCGCTAGAAG GCGAGCCAGA 550 

GTGGGAAGCA AAAATCCTTG AATTAGCAGC GGCGCTGGAT TCTTACATTC 600 

CAGAACCACA ACGTGACATC GATAAGCCGT TCCTACTGCC AATCGAAGAC 650 

GTATTCTCAA TTTCAGGCCG TGGTACAGTA GTAACAGGTC GTGTTGAGCG 700 

TGGTATTGTA CGCGTAGGCG ACGAAGTTGA AATCGTTGGT GTACGTGCGA 750 

CAACTAAGAC AACGTGTACT GGTGTAGAAA TGTTCCGTAA ACTGCTTGAC 8 00 

GAAGGTCGTG CAGGTGAGAA CTGTGGTATT TTGTTACGTG GTACTAAGCG 850 

TGATGACGTA GAACGTGGTC AAGTATTAGC GAAGCCAGGT TCAATCAACC 900 

CACACACTAC TTTTGAATCA GAAGTTTACG TACTGTCAAA AGAAGAAGGT 950 

GGTCGTCACA CGCCATTCTT CAAAGGCTAC CGTCCACAGT TCTACTTCCG 1000 

TACAACTGAC GTAACCGGTA CTATCGAACT GCCAGAAGGC GTAGAGATGG 1050 

TAATGCCAGG CGATAACATC AAGATGGTAG TGACACTGAT TTGCCCAATC 1100 

GCGATGG 1107 



2) INFORMATION FOR SEQ ID NO: 1842 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1116 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Obesumbacterium proteus 

(B) STRAIN: ATCC 12841 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1842 

GTGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTTA ACGTTGGTAC 50 

AATCGGCCAC GTTGACCACG GTAAAACTAC CCTGACTGCT GCAATCACTA 100 

CCGTTCTGGC TAAAACCTAC GGTGGTTCTG CACGTGCATT CGACCAGATC 150 

GATAACGCGC CAGAAGAAAA AGCTCGTGGT ATCACCATCA ACACTTCTCA 200 

CGTTGAATAT GACACCCCGA CTCGCCACTA CGCACACGTA GACTGCCCAG 250 

GCCACGCCGA CTATGTTAAA AACATGATCA CCGGTGCTGC GCAGATGGAC 300 

GGCGCGATCC TGGTTGTTGC TGCGACTGAC GGCCCTATGC CTCAGACTCG 350 

TGAGCACATC CTGCTGGGTC GTCAGGTTGG CGTTCCTTAC ATCATCGTAT 4 00 

TCCTGAACAA ATGCGACATG GTTGATGATG AAGAGCTGCT GGAGCTGGTA 4 50 

GAAATGGAAG TTCGTGAACT TCTGTCTCAG TACGACTTCC CAGGCAATGA 500 

962 
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TACTCCAATC ATCCGTGGTT CTGCTCTGAA 

AGTGGGAAGC TAAGATCGTA GAACTGGCTG 

CCAGAACCAG AACGTGCTAT CGACAAGCCA 

CGTATTCTCA ATCTCTGGCC GTGGTACTGT 

GCGGTATCGT TAAAGTTGGT GAAGAAGTTG 

ACCGTTAAAT CAACTTGTAC CGGCGTTGAA 

CGAAGGTCGT GCAGGCGAGA ACGTTGGTGT 

GTGAAGACAT CGAACGTGGT CAGGTTCTGG 

CCACACACCA AGTTCGAATC AGAAGTTTAT 

CGGCCGTCAT ACTCCGTTCT TCAAAGGCTA 

GTACAACTGA CGTGACCGGT ACCATCGAAC 

GTAATGCCAG GCGATAACAT CAAAATGATC 
CGCAATGGAC GATGGT 



PCT/CA00/01150 

AGCGCTGGAA GGCGAAGCTG 550 

AAACTCTGGA TTCTTACATC 600 

TTCCTGCTGC CAATCGAAGA 650 

TGTTACCGGT CGTGTAGAGC 700 

AGATTGTTGG TATCAAAGAT 750 

ATGTTCCGTA AACTGCTGGA 800 

TCTGCTGCGT GGTATCAAGC 850 

CTAAACCAGG TTCTATCAAA 900 

ATTCTGAGCA AAGATGAAGG 950 

CCGTCCACAG TTCTACTTCC 1000 

TGCCAGAAGG CGTGGAAATG 1050 

GTTACCCTGA TCCACCCAAT 1100 

1116 



2) INFORMATION FOR SEQ ID NO: 1843 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1129 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE:. 

(A) ORGANISM: Budvicia aquatica 

(B) STRAIN: ATCC 35567 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1843 

GTGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTTA ACGTTGGTAC 50 

TATCGGCCAC GTTGACCACG GTAAAACAAC TCTGACTGCT GCTATCACCA 100 

GCGTTTTAGC TAAAACTTAT GGCGGTAACG CTCGTGCATT CGATCAAATC 150 

GATAATGCAC CAGAAGAAAA AGCACGTGGT AT C AC CATC A ACACTTGTCA 200 

CGTTGAGTAT GATACTCCTG CTCGCCACTA CGCACACGTA GACTGCCCAG 250 

GACACGCCGA CTATGTGAAA AACATGATCA CCGGTGCTGC TCAAATGGAC 300 

GGCGCGATCT TAGTTGTTGC GGCAACTGAT GGTCCTATGC CACAGACTCG 350 

TGAGCACATC CTGTTAGGTC GTCAGGTTGG CGTTCCTTAC ATCATCGTGT 400 

TCCTGAACAA GTGTGACATG GTTGATGACG AAGAACTGTT AGAATTAGTT 450 

GAAATGGAAG TTCGTGAGCT TCTTTCTGCT TATGATTTCC CTGGTGACGA 500 

TACTCCAGTT GTTCGTGGTT CTGCGCTGAA AGCGTTAGAA GGCGAAGCTG 550 

AGTGGGAAGC TAAGATCATT GAATTAGCCG GATATCTGGA TAGCTACATC 600 

CCAGAGCCAG AGCGTGCGAT TGACCGTCCG TTCCTGCTGC CAATCGAAGA 650 

CGTATTCTCT ATTTCAGGCC GTGGTACAGT TGTTACCGGT CGTGTAGAGC 700 

GCGGAATCGT TAAAGTCGGT GAAGCCGTTG AGATTGTTGG TATCAAAGAT 750 

ACCGTACAAA CTACCTGTAC TGGCGTTGAA ATGTTCCGTA AGTTACTTGA 800 

CGAAGGCCGT GCTGGTGAGA ACGTTGGTGT TCTGCTGCGT GGTACTAAGC 850 

GTGAAGAAAT CGAACGCGGT CAAGTACTGG CTAAGCCAGG TTCAATCAAC 900 

CCGCACACCA ACTTTGTATC AGAAGTTTAT ATTCTGAGCA AAGATGAAGG 950 

TGGTCGTCAT ACTCCATTCT . TCAAAGGCTA CCGTCCACAG TTCTACTTCC 1000 

GTACAACTGA CGTGACCGGT ACTATCGAAC TGCCAGAAGG CGTAGAGATG 1050 

963 
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GTGATGCCTG GTGACAACAT TCAGATGACT GTAACTCTGA TTGCACCAAT J. 100 
CGCGATGGAC GAAGGTTTAC GCTTCGCTA 1129 



2) INFORMATION FOR SEQ ID NO: 1844 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 810 bases 
10 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



15 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Abiotrophia adiacens 

(B) STRAIN: ATCC 49175 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1844 

ATATTCATAA TGCATTACAA GTTGTGAAAA CAACAAGTGA CGGAAGTGAA 50 

AAGACTGTTA CATTGGAAAC TGCTGTAGAA TTAGGGGATG GTGCAGTTCG 100 

TACGATTGCC ATGGAATCTA CAGATGGTTT GCAACGTGGC ATGAAAGTAG 150 

25 TGGACTTAGG ACGCACAATT AGCGTTCCTG TGGGACCTGA AACATTAGGT 200 

CGTGTATTCA ACGTTTTAGG AGATACAATC GACTTGAAAG AACCATTCCC 250 

AGAAGACTTT ACAAGACATG AAATCCATAA ACCAGCACCA AAATTTGAAG 300 

AATTAAACAG TCAATATGAA ATTCTACAAA CAGGGATTAA AGTTATTGAC 350 

CTTTTAGCAC CTTATCTTAA AGGTGGTAAA ATCGGTTTAT TCGGTGGTGC 400 

30 CGGTGTAGGG AAAACCGTAT TAATTCAAGA ATTAATTCAT AATATCGCTG 450 

AAGAACTTGG TGGTATTTCA GTATTTACAG GGGTAGGGGA ACGTACTCGT 500 

GAAGGGAATG ACCTTTACCA TGAAATGCAA GAATCAGGCG TATCTGCTAA 550 

AACAGCGATG GTGTTTGGGC AAATGAACGA ACCACCAGGA GCTCGTATGC 600 

GTGTAGCACT AACAGGGTTA ACTATTGCGG AATACTTCCG TGATATGGAA 650 

35 AAACAAGACG TGCTTTTATT CATCGATAAC ATTTATCGTT TCACGCAAGC 700 

AGGTTCAGAA GTGTCAGCGT TACTTGGTCG TATGCCTTCT GCCGTAGGGT 750 

ATCAACCAAC ATTAGCGACA GAAATGGGTC AATTACAAGA ACGTATCAGT 800 

TCAACTAAAG 810 



40 



2) INFORMATION FOR SEQ ID NO: 1845 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 815 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Arcanojbacteriu/n haemolyticum 

964 
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(B) STRAIN: ATCC 9345 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1845 

5 TGCTCTGCCA GAAATTAACA ACGCCCTCCT CACCGAAGTA GACCTCTCCG 50 

GCCAAGGCGA AGGCGAAAGC GTTCTCAAGA TGACTCTTGA GGTTGCTCAG 100 

CACCTCGGCG ATAACATCGT CCGTACCATC GCCATGAAGC CAACCGACGG 150 

TCTGGTTCGC GGCGCCACCG TTATCGATAC CGGCGCCCCA ATCACCGTGC 200 

CAGTTGGCGA CGCAACTAAA GGTCATGTTT TCAACGTGAC CGGTGATGTC 250 

10 CTAAACTTGG GCGAAGGCGA AACCCTTGAC GTCAAGGAAC GGTGGCCAAT 300 

CCACCGCAAG GCTCCACAGT TCGACGAACT CGAACCGGAA ACCAAGATGT 350 

TCGAAACAGG CATCAAGGTG ATCGATCTCC TCACCCCATA CGTACAGGGC 4 00 

GGCAAGATCG GTCTGTTTGG CGGTGCTGGT GTTGGTAAGA CCGTTCTTAT 4 50 

CCAGGAAATG ATCCAGCGTG TTGCACAGGA TCATGGCGGT GTGTCCGTGT 500 

15 TCGCGGGTGT GGGTGAACGT ACCCGTGAAG GTAACGATCT TATCCACGAA 550 

ATGGAAGATG CGGGCGTTCT TGATAAGACC GCGCTTGTGT TCGGCCAGAT 600 

GGATGAACCG CCAGGGGTTC GTTTGCGTAT TGCACTTTCC GGCCTGACCA 650 

TGGCGGAATA CTTCCGTGAC GTGCAAAACC AGGACGTGCT TTTGTTCATC 700 

GATAACATCT TCCGCTTCAC CCAGGCAGGT TCGGAAGTGT CCACGTTGCT 750 

2 0 TGGCCGTATG CCATCAGCAG TGGGCTACCA GCCGACCTTG GCAGATKAAA 800 

TGGGCGCATT GCAGG 815 



25 2) INFORMATION FOR SEQ ID NO: 1846 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1073 bases 

(B) TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

35 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Basidiobolus r ana rum 

(B) STRAIN: ATCC 24670 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1846 



ACCTTCCTCC TATCTTGAAH GCCCTGGAGG TCCAAAACCA CAGCTCTCGY 50 

TTAGTTTTAG AGGTGTCCCA GCATTTGGGT GAAAACACCG TTCGTACTAT 100 

TGCTATGGAC GGTACTGAAG GATTGGTTCG TGGTCAAAAT GTCGTAGATA 150 

CCGGATATCC TATTAGAGTT CCTGTCGGTC CTGAATGTTT GGGTCGTATC 200 

45 ATGAACGTTA TTGGCGAGCC TGTTGATGAG CGCGGYCCTA TCAAGACCAA 250 

GAAGCTTGCA CCCATCCACG CTTCTCCCCC CGAGTTCGTY GACCAATCCA 300 

CCACCCCCGA AATCTTGGAG ACTGGTATTA AGGTTGTCGA TTTGTTGGCC 350 

CCTTACGCTC GTGGTGGTAA GATCGGTCTT TTCGGTGGTG CCGGTGTCGG 400 

TAAGACTGTG TTTATCCAGG AGTTGATYAA CAACGTTGCC AAGGCCCACG 450 

50 GTGGTTACTC CGTGTTCGCT GGTGTTGGTG AGCGTACTCG TGAGGGTAAC 500 

GATTTGTACC AC GAG AT GAT TCAAACTGGT GTCATCAAGC TTGATGGCCA 550 

ATCCAAGGCT GCCCTTGTCT ACGGMCAAAT GAACGAGCCC CCAGGTGCTC 600 

GTGCCCGMGT CGCTTTGACC GGTCTTACCG TTGCTGAATA CTTCCGTGAT 650 
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GAGGAAGGHC AAGATGTGTT GCTCTTCATT 
CCAAGCTGGT TCTGAAGTGT CYGCCTTGTT 
TCGGTTACCA ACCCACCTTG GCCACCGATA 
ATTACCACCA CCAAGAAGGG TTCCATTACC 
5 CCCTGCTGAT GATTTGACCG ATCCCGCTCC 
TTGACGCCAC CACCGTGTTG TCTCGTTCCA 
CCCGCTGTCG ATCCCCTCGA CTCCAAGTCT 
■ TGTCGGTGAA GAGCACTACG ACATCGCCAC 
ARTCTTACAA GTCTCTCCAG GAT 
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GACAACATTT TCCGTTTCAC 700 

GGGTCGTATC CCCTCCGCTG 750 

TGGGTGTCAT GCAAGAGCGT 800 

TCTGTCCAGG CCATTTACGT 850 

TGCCACTACT TTTGCCCATC 900 

TCTCTGAGTT GGGTATTTAC 950 

CGTATGTTGG ATCCYCGTAT 1000 

TGGTGTTCAG AAGATTCTCC 1050 

1073 



2) INFORMATION FOR SEQ ID NO: 1847 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 480 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



20 



40 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Blastomyces dermatitidis 
25 (B) STRAIN: ATCC 56220 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1847 

TGTCTTTATC CAGGAGTTGA TTGTACGTCT TGCCTCACCC TTTGGGTATT 50 

30 TTGCGAATAC TAATTATAGT AGAACAACAT TGCCAAGGCT CACGGTGGTT 100 

ACTCTGTCTT CACTGGTGTC GGTGAACGTA CTCGTGAGGG TAACGATTTG 150 

TACCACGAAA TGCAGGAAAC TGGTGTCATT CAGCTCGAGG GTGAATCCAA 200 

GGTCGCCCTC GTGTTCGGTC AGATGAACGA GCCCCCTGGT GCCCGTGCCC 250 

GTGTCGCTCT TACTGGTTTG ACCATTGCCG AGTACTTCCG TGACGAGGAG 300 

35 GGTCAAGATG TGCTTCTCTT CATTGACAAC ATTTTCCGTT TCACTCAGGC 350 

CGGTTCTGAG GTGTCTGCCC TTTTGGGTCG TATCCCCTCT GCCGTCGGTT 400 

ACCAGCCCAC TCTCGCCGTC GACATGGGTG TCATGCAGGA GCGTATTACC 450 

ACCACCACCA AGGGTTCCAT CACCTCCGTC 480 



2) INFORMATION FOR SEQ ID NO: 1848 



(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 566 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(.A) ORGANISM: Blastomyces dermatitidis 
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(B) STRAIN: ATCC 14112 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1848 

TGTCTTCATT CAGGAGTTGA TTGTACGTCC CTTCCTCTCT ACAAATGACG 50 

GGCGAGGAAA ATTTTTGGCT TTTTCTAATA GCTCGTTATA GAACAACATT 100 

GCCAAAGCCC ACGGTGGTTA CTCCGTTTTC ACTGGTGTCG GCGAGCGGAC 150 

CCGTGAAGGA AACGATTTGT ACCACGAGAT GCAGGAAACC CGTGTTATCC 200 

AGCTCGATGG CGAGTCTAAG GTCGCACTCG TCTTCGGTCA GATGAACGAG 250 

CCCCCCGGAG CCCGTGCCCG TGTTGCCCTC ACTGGCCTGA CCATTGCTGA 300 

ATATTTCCGT GACGAGGAAG GTCAAGACGG TATGTATTCA TATAAATTAC 350 

TCCGGGCAAA TTGACTCAGA ACCGCACTCA CTCACACATA TATTAGTGCT 400 

TCTCTTTATC GACAACATTT TCCGCTTCAC CCAGGCCGGT TCCGAAGTGT 450 

CCGCCCTGCT TGGTCGTATT CCCTCCGCCG TCGGTTACCA ACCCACTCTC 500 

GCCGTCGACA TGGGTGGTAT GCAGGAACGT ATCACAACCA CCACCAAGGG 550 

CTCCATTACC TYCGTG 566 



2) INFORMATION FOR SEQ ID NO: 1849 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
. (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter coli 

(B) STRAIN: ATCC 43479 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1849 

AATGAAGCCA TTGTTGTAAA TTTTGAAAGT GAAGGCAAAA AACAAAAACT 50 

TGTTTTAGAA GTAGCAGCAC ACTTGGGCGA TAATAGAGTT AGAACTATTG 100 

CTATGGATAT GACAGATGGC TTGGTAAGAG GACTTAAAGC AGAAGCTTTG 150 

GGTGCTCCTA TTAGCGTTCC TGTGGGTGAA AAAGTTTTAG GAAGAATTTT 200 

TAATGTTACG GGAGATTTGA TCGATGAAGG TGAAGAAATT TCTTTTGATA 250 

AAAAATGGGC AATTCATAGA GATCCACCAG CTTTTGAAGA TCAAAGCACA 300 

AAAAGTGAGA TTTTTGAAAC AGGGATTAAA GTTGTGGATT TACTTGCTCC 350 

TTATGCAAAA GGTGGTAAAG TAGGTCTTTT TGGTGGTGCA GGTGTTGGTA 400 

AAACTGTTAT TATTATGGAG CTTATTCACA ATGTTGCATT TAAACATAGC 450 

GGCTATTCTG TATTTGCAGG TGTAGGTGAG AGAACTCGTG AAGGAAATGA 500 

CCTTTATAAT GAAATGAAAG AAAGTAATGT TTTAGACAAA GTTGCTCTAT 550 

GTTATGGACA AATGAATGAA CCACCAGGGG CAAGAAATCG TATTGCTTTA 600 

ACAGGTTTAA CAATGGCTGA GTATTTTAGA GATGAAATGG GTCTTGATGT 650 

GCTTATGTTT ATTGATAATA TCTTTAGATT TTCACAATCA GGTTCTGAAA 7 00 

TGTCAGCACT TTTAGGAAGA ATTCCATCAG CTGTGGGTTA TCAACCAACC 750 

CTAGCAAGTG AAATGGGTAA ATTCCAAGAA AGAATTACTT CAACTAAAAA 800 

AGGATCAATT ACTTCAG 817 
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2) INFORMATION FOR SEQ ID NO: 1850 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 775 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter fetus subsp. fetus 

(B) STRAIN: ATCC 25936 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1850 

AAGGCAACAC GCATAAACTT ATTTTAGAGA CTGCTGCACA CCTTGGAGAT 50 

AATCGTGTAA GAACTATCGC TATGGATATG AGCGAAGGAC TTACAAGAGG 100 

GTTAGATGCT ATAGCGCTTG GGTCGCCTAT CAGTGTTCCT GTTGGAGAAA 150 

AAGTTTTAGG AAGAATATTC AACGTAATTG GTGATCTTAT AGACGAAGGC 200 

GAAGAAGAAA AATTTGATAA AAAATGGTCG ATTCATAGAG ATCCGCCGGC 250 

ATTTGAAGAT CAAAGCACAA AAAGTGAAAT TTTTGAAACA GGTATAAAAG 300 

TCGTAGATCT TTTGGCTCCT TATGCAAAAG GCGGTAAAGT TGGACTATTT 350 

GGCGGTGCCG GCGTTGGTAA AACAGTTATC ATTATGGAAC TTATCCACAA 400 

CGTTGCATTC AAACACAGCG GCTATTCGGT ATTTGCCGGT GTCGGTGAAA 450 

GAACAAGAGA GGGTAACGAT CTTTATAATG AAATGAAAGA ATCCGGCGTT 500 

TTGGATAAAG TTGCCTTATG TTATGGACAA ATGAATGAAC CGCCGGGTGC 550 

AAGAAACCGT ATAGCGCTTA CTGGTCTTAC AATGGCTGAG TATTTTCGTG 600 

ACGAGATGGG ACTAGATGTT CTTATGTTTA TCGATAACAT CTTCCGTTTC 650 

TCACAATCAG GCTCAGAGAT GTCGGCTCTT CTTGGACGTA TCCCAAGTGC 700 

GGTTGGTTAT CAACCAACGT TAGCTAGCGA AATGGGAAGA CTTCAAGAAA 7 50 

GAATCACATC AACTAAAAAA GGTTC 775 



2) INFORMATION FOR SEQ ID NO: 1851 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 793 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter fetus subsp. venerealis 

(B) STRAIN: ATCC 33561 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1851 

CGAAGCTATT GAAGTAAATT TTACAGTAGA AGGCAACACG CATAAACTTA 50 
TTTTAGAGAC TGCTGCACAC CTTGGAGATA ATCGTGTAAG AACTATCGCT 100 
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ATGGATATGA GCGAAGGACT TACAAGAGGG 
GTCGCCTATC AGTGTTCCTG TTGGAGAAAA 
ACGTAATTGG TGATCTTATA GACGAAGGCG 
AAATGGTCGA TTCATAGAGA TCCGCCGGCA 
5 AAGTGAAATT TTTGAAACAG GTATAAAAGT 
ATGCAAAAGG CGGTAAAGTT GGACTATTTG 
ACAGTTATCA TTATGGAACT TATCCACAAC 
CTATTCGGTA TTTGCCGGTG TCGGTGAAAG 
TTTATAATGA AATGAAAGAA TCCGGCGTTT 
10 TATGGACAAA TGAATGAACC GCCGGGTGCA 
TGGTCTTACA ATGGCTGAGT ATTTTCGTGA 
TTATGTTTAT CGATAACATC TTCCGTTTCT 
TCGGCTCTTC TTGGACGTAT CCCAAGTGCG 
AGCTAGCGAA ATGGGAAGAC TTCAAGAAAG 

15 



TTAGATGCTA TAGCGCTTGG 150 

AGTTTTAGGA AGAATATTCA 200 

AAGAAGAAAA ATTTGATAAA 250 

TTTGAAGATC AAAGCACAAA 300 

CGTAGATCTT TTGGCTCCTT 350 

GCGGTGCCGG CGTTGGTAAA 400 

GTTGCATTCA AACACAGCGG 450 

AACAAGAGAG GGTAACGATC 500 

TGGATAAAGT TGCCTTATGT 550 

AGAAACCGTA TAGCGCTTAC 600 

CGAGATGGGA CTAGATGTTC 650 

CACAATCAGG CTCAGAGATG 700 

GTTGGTTATC AACCAACGTT 750 

AATCACATCA ACT 7 93 



2) INFORMATION FOR SEQ ID NO: 1852 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter gracilis 
30 (B) STRAIN: ATCC 33236 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1852 

GGACTATTTA CCGAAGATTA ACGAAGCTAT CGAGGTTAAA TTTGACGTCG 50 

3 5 AGGGCGCTCA TCGCAGGCTG ATCCTAGAGG TAGCCGCGCA' CCTTGGAGAC 100 

AATCGCGTCC GCACGATCGC TATGGATATG AGCGATGGAC TTAGGCGAGG 150 

GCTTGAGGCC GTCGCTTTGG GCGCGCCTAT TACGGTGCCT GTGGGCGAGA 200 

AAGTTTTGGG TAGAATTTTT AATGTTACGG GCGATCTGAT CGACGAAGGC 250 

GAGGATGAAA AATTTGAAAC CCGCTGGTCG ATCC AC AG AG ATCCGCCTAG 300 

4 0 CTTTGAAAAT CAAAGCACGA AGAGTGAAAT TTTTGAAACC GGCATTAAGG 350 

TAGTCGATCT GCTCGCCCCT TATGCAAAGG GCGGTAAGGT AGGACTATTC 400 

GGCGGTGCTG GCGTCGGTAA GACCGTCATC ATCATGGAAC TGATTCACAA 450 

CGTCGCTTTC AAACACAGCG GCTACTCCGT ATTTGCGGGT GTCGGCGAGC 500 

GAACGAGAGA GGGAAACGAC CTTTATAACG AGATGAAAGA ATCGGGCGTT 550 

45 TTGGATAAAG TCGCCTTGAC CTATGGTCAG ATGAACGAAC CGCCGGGAGC 600 

GAGAAACCGT ATCGCGCTAA CCGGTCTTAC GATGGCCGAG TATTTCCGCG 650 

ACGAGCTAGG GCTTGACGTT TTGATGTTTA TTGATAATAT CTTCCGCTTC 7 00 

TCGCAGTCGG GTTCGGAGAT GTCCGCGCTT TTAGGACGAA TTCCGTCCGC 7 50 

GGTCGGTTAT CAGCCTACGC TTGCCAGCGA AATGGGTAAA TTACAGGAGC 800 

50 GCATTACTTC TACTAAGAAG GGCTC 825 
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2) INFORMATION FOR SEQ ID NO: 1853 



10 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni subsp. jejuni 

(B) STRAIN: ATCC 33560 



15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1853 

TTTACCTCAA ATTAATGAAG CAATTGTTGT AAATTTTGAA AGCGAAGGAA 50 

AAAAACATAA ACTTGTTTTA GAAGTAGCAG CTCATTTAGG AGATAATAGA 100 

GTTAGAACTA TTGCTATGGA TATGACAGAT GGTTTGGTAA GGGGCTTAAA 150 

20 AGCTGAGGCT TTAGGTGCTC CTATTAGTGT TCCTGTTGGT GAGAAAGTTT 200 

TAGGAAGAAT TTTCAATGTT ACTGGAGATT TGATCGATGA AGGTGAAGAA 250 

ATTTCTTTTG ATAAAAAATG GGCAATTCAT AGAGATCCGC CAGCTTTTGA 300 

AGATCAAAGC ACAAAAAGTG AGATTTTTGA AACAGGGATT AAAGTTGTAG 350 

ATTTGCTTGC TCCTTATGCA AAAGGTGGTA AAGTAGGTCT TTTTGGTGGT 400 

2 5 GCAGGTGTTG GTAAAACTGT TAT TAT TAT G GAGCTTATTC ACAATGTTGC 4 50 

ATTTAAGCAT AGCGGCTATT CTGTATTTGC AGGTGTGGGT GAGAGAACTC 500 

GTGAAGGAAA TGACCTTTAT AATGAAATGA AAGAAAGTAA TGTTTTAGAC 550 

AAAGTTGCTC TATGTTATGG ACAAATGAAT GAACCACCAG GAGCAAGAAA 600 

TCGTATTGCT TTAACAGGTT TAACAATGGC TGAGTATTTT AG AG AT G AAA 650 

3 0 TGGGTCTTGA TGTGCTTATG TTTATTGATA ATATCTTTAG ATTTTCACAA 700 

TCAGGTTCTG AAATGTCAGC ACTTTTAGGA AGAATTCCAT CAGCTGTGGG 750 

TTATCAACCA ACCCTAGCAA GTGAAATGGG TAAATTCCAA GAAAGAATTA 800 

CTTCAACTAA AAAAGGCT 818 



35 



50 



2) INFORMATION FOR SEQ ID NO: 1854 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 830 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus cecorum 

(B) STRAIN: ATCC 43198 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1854 



ATTACCTGAT ATCAACAACG CCTTATTGGT CTATAAAAAT GATGAACAAA 50 
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AAAGTAAAAT TGTGCTAGAA GCTGCCTTAG 
CGTACAATTG CCATGGAATC AACGGATGGT 
TGTCGATACT GGTAAACCAA TTTCAGTTCC 
GACGTGTCTT TAACGTTTTA GGGGATACGA 
5 GCACAAGATG CAGATCGTTC TGCAATTCAT 
AGACTTAAGT ACAAGTACTG AAATTTTAGA 
ACTTATTAGC ACCATATTTA AAAGGTGGTA 
GCCGGAGTAG GTAAAACCGT TTTAATCCAA 
ACAAGAACAT GGTGGGATTT CTGTATTTAC 

10 GTGAAGGAAA TGACTTGTAT CATGAAATGC 
AAAACTGCCA TGGTGTTTGG TCAAATGAAC 
GCGTGTTGCT TTAACTGGGT TAACGATTGC 
AAGGACAAGA TGTGTTGCTA TTTATTGATA 
GCGGGTTCTG AAGTATCAGC CTTGCTTGGT 

15 TTATCAACCT ACATTGGCTA CAGAAATGGG 
CTTCAACTAA GAAGGGCTCT ATCACTTCTA 



AATTAGGTGA TGGCATCATT 100 

TTACAACGTG GGATGGAAGT 150 

AGTTGGTAAA GAAACGCTAG 200 

TTGATATGCA AGAACCATTT 250 

AAAGCTGCAC CAAAATTTGA 300 

AACAGGGATT AAAGTTATCG 350 

AAGTCGGTCT ATTCGGGGGT 400 

GAATTAATCC ATAATATTGC 450 

CGGTGTTGGT GAACGTACAC 500 

GTGATTCAGG AGTTATTGAA 550 

GAACCACCTG GAGCTCGTAT 600 

TGAATATTTC CGTGATGTAG 650 

ACATCTTCCG TTTCACTCAA 700 

CGTATGCCAT CTGCCGTGGG 7 50 

TCAATTACAA GAACGTATCA 800 

830 



20 2) INFORMATION FOR SEQ ID NO: 1855 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 823 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus columbae 

(B) STRAIN: ATCC 51263 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1855 



TCTTTACCAG ATATCAATAA TGCGCTTATT GTCTATAAAA ATGATGAACA 50 

AAAAAGTAAA ATCGTGCTTG AAGCTGCTTT AGAGCTAGGA GATGGCATTA 100 

TTCGTACGAT TGCAATGGAA TCAACTGATG GATTGCAACG TGGAATGGAA 150 

GTTTTCGATA CAGGTAAGCC AATTTCAGTA CCAGTAGGTC GTGAAACATT 200 

4 0 AGGTCGTGTA TTTAATGTTT TAGGTGATAC CATTGATACG CAAGAAGCTT 250 

TTCCTGCTGA TGCGAATCGT GATGCGATTC ATAAATCAGC TCCAGCTTTT 300 

GAAGAATTAA GTACAAGTAC TGAAATCCTA GAAACAGGGA TTAAAGTTAT 350 

CGACTTACTA GCACCATACT TAAAAGGTGG GAAAGTTGGT CTATTCGGTG 4 00 

GTGCCGGTGT AGGTAAAACC GTATTAATTC AAGAATTAAT TCATAATATC 450 

4 5 GCCCAAGAAC ATGGGGGTAT TTCAGTATTT ACCGGTGTTG GTGAACGTAC 500 

ACGTGAAGGA AATGACTTGT ATCACGAAAT GCGTGATTCA GGCGTTATCG 550 

AAAAAACTGC TATGGTGTTT GGGCAAATGA ACGAACCACC TGGAGCACGT 600 

ATGCGTGTTG CGCTAACTGG ACTAACTATT GCGGAATACT TCCGTGATGT 650 

TGAAGGCCAA GACGTATTGC TATTTATTGA TAATATCTTC CGTTTTACTC 700 

50 AAGCAGGTTC TGAAGTTTCT GCCTTACTTG GTCGTATGCC TTCTGCGGTA 7 50 

GGTTATCAAC CTACTTTGGC TACTGAAATG GGTCAATTGC AAGAACGGAT 800 

TACATCAACG AAGAAAGGTT CGA 823 
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2) INFORMATION FOR SEQ ID NO: 1856 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 826 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus dispar 

(B) STRAIN: ATCC 51266 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1856 

TTACCAGACA TTAATAATGC CTTGGTTGTC TATAAAAATG ACGAACAAAA 50 

AACCAAGATT GTATTAGAAG CTGCCTTAGA ACTAGGAGAT GGTGTGATTC 100 

GAACTATCGC CATGGAATCT ACTGATGGCT TACAACGGGG AATGGAAGTT 150 

GTCGATACTG GCAGTTCCAT TTCTGTACCG GTAGGAAAAG AAACATTGGG 200 

TCGTGTATTT AACGTTTTAG GAAATACAAT TGACTTAGAA GAACCTTTTC 250 

CAGCGGATGC TAAACGTAGT GGTATCCATA AAAAAGCGCC TGATTTTGAT 300 

GAATTAAGCA CTAGTACAGA' AATTTTAGAA ACAGGGATTA AAGTTATTGA 350 

CCTATTAGCC CCTTATTTAA AAGGTGGTAA AGTCGGATTA TTCGGTGGTG 4 00 

CCGGAGTTGG TAAAACCGTT TTAATTCAAG AATTAATTCA TAATATTGCC 450 

CAAGAACATG GTGGGATTTC TGTTTTTACT GGTGTTGGTG AAAGAACACG 500 

TGAAGGTAAT GACTTGTATT ATGAAATGAA AGAATCTGGC GTTATCGAAA 550 

AAACTGCCAT GGTATTTGGT CAAATGAATG AGCCACCTGG TGCCCGGATG 600 

CGGGTTGCTT TAACCGGACT TACCATTGCG GAATACTTCC GGGACGTTGA 650 

AGGACAAGAT GTATTGCTCT TTATCGATAA TATTTTCCGT TTTACCCAAG 700 

CTGGTTCAGA AGTATCTGCC TTATTAGGAC GGATGCCCTC TGCCGTTGGT 750 

TATCAACCAA CTTTGGCTAC TGAAATGGGA CAACTTCAAG AACGGATTAC 800 

CTCAACGAAA AAAGGTTCTA TTACAT 826 



2) INFORMATION FOR SEQ ID NO: 1857 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus malodoratus 

(B) STRAIN: ATCC 43197 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1857 

TCCTTACCAG ACATCAACAA TGCGTTGATT GTTTACAAAA AAAATAAAAC 50 
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CGATCTCTAT GGAATCAACA GATGGCTTGC 
GATACAGGCA AACCAATCTC AGTTCCCGTT 
TGTGTTTAAC GTATTAGGTG AAACAATCGA 
5 AAGATGCAGT AAAAAGCGGT ATTCATAAAA 
CTTAGTACCA GTAATGAAAT TTTAGAAACA 
ATTAGCTCCT TACTTAAAGG GTGGTAAAGT 
GTGTTGGTAA AACCGTCTTG ATCCAAGAAT 
GAACACGGTG GTATTTCAGT GTTTACGGGT 

10 AGGGAACGAC CTTTATTATG AAATGAAGGA 
CTGCCATGGT GTTTGGACAA ATGAACGAGC 
GTTGCCTTGA CTGGTTTGAC ATTGGCTGAA 
ACAAGATGTG CTGTTGTTTA TCGACAACAT 
GTTCTGAAGT TTCTGCCTTG CTTGGCCGGA 

15 CAACCAACTT TGGCAACTGA AATGGGTCAA 
AACGAAGAAG GGCT 



PCT/CA00/01150 

TGGTGATGGT GTTATCCGCA 100 

AACGTGGAAT GGAAGTTGTC 150 

GGTAAAGAAA CTTTAGGTCG 200 

CAAAGAAGCG CCTTTTCCAG 250 

AAGCGCCGGC TTTTGAAGAA 300 

GGGATCAAAG TTATCGACTT 350 

CGGACTATTT GGTGGTGCCG 400 

TGATTCATAA TATCGCCCAA 450 

GTTGGTGAAC GTACTCGTGA 500 

ATCAGGCGTT ATTGAGAAAA 550 

CGCCAGGTGC GCGTATGCGT 600 

TATTTCCGAG ATGAAGAAGG 650 

CTTCCGTTTC ACTCAAGCCG 700 

TGCCTTCAGC CGTTGGCTAC 750 

TTGCAAGAAC GAATCACTTC 800 

814 



20 2)TNF0RMATI0N FOR SEQ ID NO: 1858 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 791 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

3 0 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus mundtii 

(B) STRAIN: ATCC 43186 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1858 



CGCATTAGTT GTTTATAAAA ATGATGAGCA AAAATCAAAA GTTGTTCTTG 50 

AAGCAGCATT AGAATTAGGT GACGGTGTGA TCCGTACGAT CGCAATGGAA 100 

TCGACGGATG GACTACAACG TGGAATGGAA GTCATCGACA CAAGCAAAGC 150 

GATCTCTGTA CCAGTTGGAA CAGAAACATT AGGTCGTGTG TTCAACGTGT 200 

40 TAGGTGAAAC AATCGATTTG GAAGCACCAT TTCCAGAGGA TGCCCAAAGA 250 

AGCGAGATCC ACAAGAAAGC ACCAAATTTT GATGAATTAA GCACAAGTAC 300 

AGAGATTCTT GAAACTGGGA TCAAAGTCAT TGACTTATTA GCACCTTATT 350 

TAAAAGGTGG GAAAGTTGGA TTGTTTGGGG GTGCCGGTGT TGGTAAAACC 4 00 

GTACTGATCC AAGAATTGAT CCATAATATC GCCCAAGAAC ATGGGGGAAT 450 

45 CTCAGTGTTT ACCGGTGTAG GGGAACGTAC CCGTGAAGGA AACGATCTGT 500 

ATTACGAAAT GAAAGATTCA GGCGTAATCG AAAAAACAGC GATGGTGTTT 550 

GGACAAATGA ATGAGCCACC AGGTGCTCGT ATGCGTGTCG CACTAACTGG 600 

ATTGACGATT GCGGAATATT TCCGTGATGT CGAAGGACAA GACGTGCTCT 650 

TATTTATTGA TAATATTTTC CGTTTCACCC AAGCAGGTTC AGAAGTATCT 700 

50 GCCTTACTAG GACGTATGCC ATCAGCGGTT GGTTATCAAC CAACCTTAGC 750 

GACTGAAATG GGACAACTCC AAGAACGGAT CACTTCAACG A 7 91 
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2) INFORMATION FOR SEQ ID NO: 1859 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus raffinosus 

(B) STRAIN: ATCC 49427 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1859 

TCCTTACCAG ACATCAACAA TGCGTTGATT GTTTATAAAA AAGATAAAAC 50 

AAAAGTTGTT CTTGAAGCTG CTTTGGAACT TGGTGATGGT GTTATTCGCA 100 

CAATCGCCAT GGAATCAACG GATGGATTAC AACGTGGAAT GGAAGTTGTC 150 

GATACTGGCA AGCCTATTTC TGTTCCAGTA GGAAAAGAAA CTCTAGGTCG 200 

TGTATTTAAT GTATTAGGTG AAACAATCGA CAAGGAAGCG CCTTTTCCAG 250 

AAGATGCAGA AAAAAGTGGT ATTCACAAGA AAGCACCAAC TTTCGAAGAA 300 

CTTAGCACAA GTAATGAGAT CTTAGAAACA GGAATCAAAG TTATTGACTT 350 

GTTAGCTCCT TACTTAAAAG GTGGTAAAGT TGGATTATTT GGTGGTGCCG 400 

GTGTTGGTAA AACAGTCTTG ATTCAAGAGC TAATTCATAA TATCGCTCAA 450 

GAACATGGTG GTATTTCTGT GTTTACTGGT GTTGGTGAAC GTACTCGTGA 500 

AGGGAACGAC CTTTATTATG AAATGAAAGA TTCTGGTGTT ATTGAGAAAA 550 

CTGCTATGGT GTTCGGTCAA ATGAACGAGC CGCCAGGTGC ACGTATGCGT 600 

GTTGCCTTAA CTGGTTTAAC CTTAGCCGAA TACTTCCGTG ATGAAGAAGG 650 

ACAAGATGTG TTGCTATTTA TTGACAACAT TTTCCGTTTC ACTCAAGCCG 700 

GATCAGAAGT TTCTGCCTTA CTTGGCCGTA TGCCGTCAGC AGTTGGTTAC 750 

CAACCGACTT TAGCAACTGA AATGGGTCAA TTACAAGAAC GTATTACGTC 800 

GACGAAAAAA GGTTCAA 817 



2) INFORMATION FOR SEQ ID NO: 1860 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Globicatella sanguis 

(B) STRAIN: ATCC 51173 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1860 

CCTGACATTC ATAATGCATT AATTGTAACG AACGCTGATA TGGCGGATGT 50 
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AATGCAAGAA AATATTTCGG ATGAAGAAAA ATTATTAACC TTAGAAGTTG 100 

CACTGGATTT AGGTCATGGA ATGGTCCGGA CAATTGCGAT GGAATCAACC 150 

GATGGTTTGG AACGCGGCAT GACAGTTGTG GATTATTTAA CACCGATTAA 200 

AGTGCCAGTA GGCGAAGCCA CTTTAGGTAG AGTATTCAAT GTTTTAGGTG 250 

5 AGACAATTGA TGAACTAGAA CCGGTTGGCG ACGATGTTGA ACTCAAAAGT 300 

ATTCATCGTG AAGCCCCTAA ATATGAGGAC TTAGATAATA GTTTTCATGT 350 

TTTAGAAACC GGAATTAAGG TCATCGATTT ATTAGCTCCT TATATTAAAG 400 

GGGGAAAAAT CGGTTTATTC GGTGGTGCCG GAGTGGGTAA AACGGTCTTA 4 50 

ATTCAAGAAT TAATTCATAA TATTGCAGAA CAATTAGGAG GTATCTCAGT 500 

10 TTTCACTGGG GTTGGAGAAC GTACCCGTGA AGGGAATGAC CTCGTTTTTG 550 

AAATGCGAGA GTCAGGTGTA AGCAAGAAGA CGGCCATGGT TTTCGGTCAA 600 

ATGAATGAAC CACCTGGAGC ACGTATGCGT GTTGTCTTAA CAGGACTTAC 650 

AATGGCGGAA TATTTCCGTG ACGAATTGAA ACAAGACGTC TTATTATTTA 7 00 

TTGATAATAT TTATCGTTTT ACTCAAGCAG GTTCCGAAGT GTCAGCCTTA 750 

15 TTAGGTCGTA TGCCTTCAGC AGTAGGGTAT CAACCAACTT TAGCAAGTGA 800 

AATGGGACAA ATGCAAGAAC GTATTACGTC WACGAAGCRC GGTTCCATTA 850 

CA 852 



20 



2) INFORMATION FOR SEQ ID NO: 1861 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 828 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



30 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus garvieae 

(B) - STRAIN: ATCC 49156 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1861 

GCGCGACTCT TCCTGAGATT AATAACGCAC TCATCGTTTA CAAAGATGTA 50 

GACGGCGTTA AAACTAAAAT CGTCCTTGAA GTGGCGTTGG AACTTGGTGA 100 

TGGTGCCGTA CGTACCATCG CTATGGAATC AACTGATGGC TTGACACGTG 150 

40 GACTTGAAGT TCTCGATACA GGTAAAGCAA TCAGCGTACC TGTTGGTCAA 200 

GAAACACTTG GACGTGTCTT CAATGTACTT GGAGATGCTA TTGATGGAGG 250 

GGAAGCATTT GCTGAAAATG CAGAACGCAG CCCTATCCAT AAAAAAGCCC 300 

CATCTTTTGA TGAACTTTCA ACAGCAAATG AAATTCTGGT GACAGGGATT 350 

AAAGTTATTG ACTTGCTTGC CCCATACCTT AAAGGTGGTA AGATTGGGTT 400 

45 GTTCGGTGGT GCCGGAGTTG GTAAAACCGT CCTTATCCAA GAGTTGATTC 450 

ACAATATTGC CCAAGAACAC GGTGGTATTT CCGTATTTAC TGGTGTTGGG 500 

GAACGTACAC GTGAAGGGAA TGACCTTTAC TGGGAAATGA AAGAATCAGG 550 

CGTTATCGAA AAAACAGCCA TGGTCTTCGG TCAAATGAAT GAACCACCTG 600 

GAGCACGTAT GCGTGTTGCT CTTACTGGTT TGACAATTGC TGAATATTTC 650 

50. CGTGATGTAG AAAAACAAGA CGTTTTGCTT TTCATTGATA ATATCTTCCG 7 00 

TTTCACCCAA GCCGGTTCAG AAGTATCTGC CCTCTTAGGA CGTATGCCAT 750 

CAGCCGTTGG TTACCAACCT ACGCTTGCAA CTGAAATGGG TCAACTTCAA 800 

GAACGTATCA CTTCAACAAA ACAAGGTT 828 
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2) INFORMATION FOR SEQ ID NO: 1862 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Lactococcus lactis 

(B) STRAIN: ATCC 11454 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1862 

AATTGCCTGA RATTAACAAT GCCTTGATTG TCTACAAAGA TGTCAATGGC 50 

CTAAAAACAA AAATTACTCT TGAAGTTGCT TTGGAACTTG GTGATGGTGC 100 

AGTTCGTACA ATCGCTATGG AATCTACTGA TGGCTTGACT CGTGGACTTG 150 

20 AAGTCCTTGA TACAGGTAAA GCAGTCAGCG TTCCTGTTGG GGAAGCCACT 200 

CTTGGTCGTG TTTTTAACGT TCTTGGTGAT GTTATTGACG GTGGGGAAGA 250 

ATTTGCTGCT GATGCAGAAC GTAATCCTAT CCATAAAAAA GCTCCAACAT 300 

TTGACGAATT GTCAACTGCA AACGAAGTTC TCGTAACTGG GATTAAAGTT 350 

GTCGATTTGC TTGCACCTTA CCTTAAAGGT GGTAAAGTTG GACTTTTCGG 4 00 

25 TGGTGCCGGA GTTGGTAAAA CCGTCCTTAT TCAAGAATTG ATTCACAACA 4 50 

TCGCCCAAGA ACACGGAGGT ATTTCTGTGT TTACCGGTGT TGGGGAACGT 500 

ACTCGTGAAG GGAATGACCT TTACTGGGAA ATGAAAGAAT CAGGCGTTAT 550 

TGAAAAAACT GCCATGGTCT TTGGTCAAAT GAATGAACCA CCAGGAGCAC 600 

GTATGCGTGT TGCCCTTACT GGTTTGACAA TTGCTGAATA TTTCCGTGAT 650 

3 0 GTTCAAGGTC AAGACGTACT GCTTTTCATT GACAACATCT TCCGTTTCAC 7 00 

ACAAGCTGGT TCAGAAGTTT CTGCCCTTTT GGGACGTATG CCTTCTGCCG 750 

TTGGTTACCA ACCAACACTT GCTACTGAAA TGGGGCAATT GCAAGAACGT 800 

ATCACTTCTA CTAAGAAGGG TTCTGTTA 828 



35 



50 



2) INFORMATION FOR SEQ ID NO: 1863 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 825 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria ivanovii 

(B) STRAIN: ATCC 19119 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18 63 

CTTACCTGAA ATCTACAACG CCCTAGTTAT TGAATATAAA TCTGATGCAG 50 
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AAGAAGCACC AACTAGCCAA CTTACTTTAG AAGTAGCCAT TCAATTAGGT 100 

GATGACGTTG TTCGTACAAT TGCAATGGCA TCAACTGATG GTGTTCAAAG 150 

AGGTATGGAA GTTATTGATA CTGGGAGCCC AATCACAGTT CCAGTTGGTA 200 

CAGTAACACT TGGTCGTGTA TTTAACGTAT TAGGAAACAC TATCGATTTG 250 

5 GATGAGCCAC TTCCAAGCGA TATCAAGCGT AATAAAATTC ACCGTGAAGC 300 

ACCAACATTT GACCAATTAG CAACAACTAC TGAAATTCTT GAAACAGGAA 350 

TAAAAGTTGT AGACTTGCTA GCCCCATATT TAAAAGGTGG TAAAATTGGT 400 

TTGTTCGGCG GAGCGGGTGT TGGTAAAACC GTTTTAATCC AAGAACTTAT 4 50 

TCATAATATC GCTCAAGAAC ATGGTGGTAT TTCTGTGTTC GCTGGTGTTG 500 

10 GAGAACGTAC TCGTGAAGGG AACGATCTTT ACTTTGAAAT GAAAGACTCT 550 

GGTGTAATTG AAAAAACTGC CATGGTATTC GGTCAAATGA ACGAACCACC 600 

AGGTGCTCGT ATGCGTGTAG CTTTAACAGG TCTTACAATC GCTGAATATT 650 

TCCGTGATGA AGAACACCAA GATGTACTTC TATTCATTGA TAATATTTTC 7 00 

CGCTTTACTC AAGCTGGTTC AGAGGTTTCG GCTTTACTAG GTCGTATGCC 750 

15 ATCTGCAGTA GGTTATCAAC CAACTCTAGC TACCGAAATG GGACAATTAC 800 

AAGAACGTAT TACTTCTACT AATGT 825 



20 2) INFORMATION FOR SEQ ID NO: 1864 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 



30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Succinivibrio dextrinosolvens 

(B) STRAIN: ATCC 19716 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1864 

35 

GTCATATTCC AGAGCTTTAT GACGCTCTGG AGGTTAAGGG CGATGGTAAG 50 

CATCGTTCAG ACCTAGTTCT TGAGGTTCAA CAGCAGATTG GCGGTGGTGT 100 

GGTACGCTGC ATTGCCATGG GTTCTTCTGA CGGTTTGAGC AGAGGAATTG 150 

AGGCTGTAAA TACTGGTGCC GGTGTTAAGG TTCCAGTTGG TCGTGAGACC 200 

40 CTAGGACGTA TTATGAACGT TTTAGGTCAG CCTGTAGATG AGAGAGGTCC 250 

TATCGGACAG AAAGAGGATT GGGAAATTCA CCGTCCAGCT CCTACCTATG 300 

CTGAGCAGTC ATCAACTACA GAAATTCTAG AAACCGGTAT TAAGGTTATG 350 

GACCTTATCT GCCCATTTGC TAAGGGTGGT AAAGTTGGTC TGTTCGGTGG 400 

TGCCGGTGTG GGTAAGACAG TTAACATGAT GGAGCTTATC AATAACATTG 450 

45 CTAAGGCTCA CTCAGGTCTA TCTGTATTTA CCGGTGTTGG TGAGCGTACT 500 

CGTGAGGGTA ACGACTTCTA CCACGAAATG CAGGAATCAA AGGTTATCGA 550 

TAAGGTATCA ATGATTTACG GTCAGATGAA CGAGCCTCCA GGGAACCGTC 600 

TACGTGTTGC TCTGACAGGT CTGACTGTTG CTGAGAAGTT CCGTGACGAA 650 

GGTCTGGATG TGCTTCTGTT CATCGATAAC ATCTATCGTT ATACACTGGC 700 

50 TGGTACAGAG GTATCTGCTC TGTTAGGCCG TATGCCTTCA GCTGTGGGTT 7 50 

ACCAGCCTAC ACTGGCTGAG GAAATGGGTG TATTACAGGA GCGTATTGCT 800 

TCAACTAAGA AAGGTTCTAT T 821 
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2) INFORMATION FOR SEQ ID NO: 1865 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 822 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tetragenococcus halophilus 

(B) STRAIN: ATCC 33315 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1865 

TGATACTTTA CCAGATATCA ATAATGCATT AGCCGTATAT AAAAATGATG 50 

AGAACAAGAC GCGTGTTGTA TTGGAAGCTA CTTTAGAACT TGGAGATGGG 100 

GTAATTCGTG CCATTTCTAT GGGGTCTACT GACGGCTTGC AACGTGGCAT 150 

20 GGAAGTTGTG GATACACAAG AACCTATTTC TGTTCCGGTA GGAAATGATA 200 

CTTTAGGTCG TGTATTTAAT GTGTTAGGAG AAACAATAGA TAATCAGGAG 250 

CCATTTCCTG AAGATGCTGA AAAAAGTGGT ATTCACAAAA AAGCCCCTAG 300 

TTTTGATGAA TTAAGTACTA GTTCGGAAAT ATTAGAAACA GGGATCAAAG 350 

TGATTGATTT ATTAGAACCT TATCTAAGAG GCGGTAAAGT CGGATTGTTT 4 00 

2 5 GGAGGCGCCG GTGTTGGAAA AACGGTGCTA ATTCAAGAAT TGATCAATAA 450 

TGTTGCCCAA GAACACGGGG GTATTTCCGT GTTTAATGGT GTAGGTGAAC 500 

GTACTCGTGA AGGTAATGAC TTGTATTATG AAATGCAGGA TTCAGGCGTT 550 

ATCGAAAAAA CAGCCATGGT GTTTGGTCAA ATGAACGAAC CACCAGGTGC 600 

TCGTATGCGT GTTGCTTTAA CTGGCCTAAC ACTGGCAGAA TATTTTCGAG 650 

30 ATGTTGAAGG TCAAGACGTA TTATTATTTA TTGATAATAT TTTCCGTTTT 700 

ACACAAGCAG GTACCGAAGT TTCCGCTTTA CTTGGTAGAA TGCCATCTGC 750 

TGTTGGCTAT CAACCCACAC TAGCAACTGA AATGGGGCAA CTGCAAGAAC 800 

GGATTACGTC AACGGATAAG GG 822 



35 



50 



2) INFORMATION FOR SEQ ID NO: 1866 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 818 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Ca/npylojbacter fetus subsp. fetus 

(B) STRAIN: ATCC 25936 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1866 

ATCTCCTCAG GATCKATAGG ACTTGATATA GCTCTTGGTA TAGGCGGCGT 50 
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ACCAAAAGGA 
CAACTCTTAC 
TGCGCGTTTG 
TTTAGGCGTT 
AGCAAGCTCT 
CTTATAGTAA 
AGGCGATATG 
AAGCACTTAG 
GTATTTATAA 
TCCTGAAACT 
GACTTGACGT 
GGAAACCGCG 
TAGACAAGCC 
GAGAGATAAT 
GCTTGGTTTA 
AAAAGCGTTT 



AGAATAGTCG 
TTTGCATTTA 
TAGATGCAGA 
GATACGGATA 
TGATATAGTA 
TAGATAGCGT 
GGAGATCAGC 
AAAATTAACC 
ACCAAATTCG 
ACTACTGGCG 
ACGTAAAATA 
TAAAAGTAAA 
GAATTTGATA 
AGATTACGGC 
GCTATGATAA 
TTAAAAGA 



AAATTTATGG 
ATAGCAGAAT 
GCATGCACTT 
ACTTATATAT 
GAAACTCTAG 
AGCAGCTYTA 
ACGTAGGGCT 
GGAGTTGTCC 
TATGAAAATC 
GAAATGCGCT 
GCTACTTTAA 
AGTAGTAAAA 
TCATGTTTGG 
GTAAAACTTG 
TTCAAAATTA 



GCCAGAAAGC 
CTCAAAAAGT 
GATGTTAAAT 
TTCTCAACCG 
CTAGAAGCGG 
ACACCAAAAA 
GCAAGCAAGA 
ATAAAATGGG 
GGCGCTATGG 
TAAATTTTAC 
AACAGAGCGA 
AACAAAGTCG 
AGAAGGTATC 
ATATTATCGA 
GGTCAAGGCA 



TCTGGTAAAA 
CGGCGGAGTT 
ATGCTAAAAA 
GACTTCGGAG 
CGCCGTTGAT 
GCGAAATAGA 
CTCATGAGTC 
AACTACAGTT 
GCTATGGCAC 
GCTTCAGTTA 
TGAGCCAATC 
CTCCTCCTTT 
AGCAAAGAAG 
TAAAAGCGGC 
GAGAAAACTC 



100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
818 



2) INFORMATION FOR SEQ ID NO: 1867 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 814 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter fetus subsp. venerealis 

(B) STRAIN: ATCC 33561 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1867 

TCTCCTCAGG ATCKATAGGA CTTGATATAG CTCTTGGTAT AGGCGGCGTA 50 

CCAAAAGGAA GAATAGTCGA AATTTATGGG CCAGAAAGCT CTGGTAAAAC 100 

AACTCTTACT TTGCATTTAA TAGCAGAATC TCAAAAAGTC GGCGGAGTTT 150 

GCGCGTTTGT AGATGCAGAG CATGCACTTG ATGTTAAATA TGCTAAAAAT 200 

TTAGGCGTTG ATACGGATAA CTTATATATT TCTCAACCGG ACTTCGGAGA 250 

GCAAGCTCTT GATATAGTAG AAACTCTAGC TAGAAGCGGC GCCGTTGATC 300 

TTATAGTAAT AGATAGCGTA GCAGCTYTAA CACCAAAAAG CGAAATAGAA 350 

GGCGATATGG GAGATCAGCA CGTAGGGCTG CAAGCAAGAC TCATGAGTCA 400 

AGCACTTAGA AAATTAACCG GAGTTGTCCA TAAAATGGGA ACTACAGTTG 450 

TATTTATAAA CCAAATTCGT ATGAAAATCG GCGCTATGGG CTATGGCACT 500 

CCTGAAACTA CTACTGGCGG AAATGCGCTT AAATTTTACG CTTCAGTTAG 550 

ACTTGACGTA CGTAAAATAG CTACTTTAAA ACAGAGCGAT GAGCCAATCG 600 

GAAACCGCGT AAAAGTAAAA GTAGTAAAAA ACAAAGTCGC TCCTCCTTTT 650 

AGACAAGCCG AATTTGATAT CATGTTTGGA GAAGGTATCA GCAAAGAAGG 700 

AGAGATAATA GATTACGGCG TAAAACTTGA TATTATCGAT AAAAGCGGCG 750 

CTTGGTTTAG CTATGATAAT TCAAAATTAG GTCAAGGCAG AGAAAACTCA 800 

AAAGCGTTTT TAAA 814 
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2) INFORMATION FOR SEQ ID NO: 1868 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 824 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni jejuni 

(B) STRAIN: ATCC 33560 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1868 

GATAGCATAG GCWCAGGTTC AGTTGGACTT GATCTTGCTT TAGGTATAGG 50 

CGGTGTTCCA AAAGGAAGAA TTATAGAAAT TTATGGGCCT GAAAGTTCAG 100 

GTAAAACTAC TCTAACTCTA CACATTATCG CAGAATGCCA AAAAGCAGGT 150 

GGGGTTTGTG CTTTTATCGA TGCAGAACAT GCACTTGATG TGAAATATGC 200 

TAAAAATTTA GGTGTAAATA CAGATGATTT GTATGTTTCT CAACCTGATT 250 

TTGGAGAGCA AGCCTTAGAA ATTGTAGAAA CTATAGCWAG AAGTGGTGCA 300 

GTAGATCTTA TWGTAGTAGA TAGCGTTGCA GCWCTTACCC CAAAAGCAGA 350 

AATTGAAGGC GATATGGGCG ATCARCATGT AGGACTTCAA GCAAGACTTA 400 

TGTCTCAAGC TCTAAGAAAA CTTACAGGTA TAGTTCATAA AATGAATACC 4 50 

ACAGTAATTT TCATCAACCA AATTCGTATG AAAATCGGTG CTATGGGTTA 500 

TGGTACTCCT GAAACCACAA CAGGTGGAAA TGCATTAAAA TTTTATGCTT 550 

CTGTGCGTTT AGATGTTAGA AAAGTAGCAA CCTTAAAMCA AAACGWAGAM 600 

CCTATAGGAA ACCGCGTTAA AGTAAAAGTA GTTAAAAATA AAGTTGCTCC 650 

TCCATTCAGM CAAGCTGAAT TTGATGTGAT GTTTGGAGAG GGTTTAAGCC 700 

GTGAAGGTGA ATTGATCGAT TATGGTGTAA AACTTGATAT CGTAGATAAA 7 50 

AGTGGTGCGT GGTTTTCTTA TAAAGATAAA AAACTTGGAC AAGGTAGAGA 800 

AAATTCAAAA GCTTTCTTAA AAGA 824 



2) INFORMATION FOR SEQ ID NO: 1869 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus avium 

(B) STRAIN: ATCC 14025 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1869 

AGAAAGTTCT GGTAAAACAA CGGTTGCACT GCATGCGATT GCAGAAGTTC 50 
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AAAAACATGG CGGGACGGCA 

CCTCAATACG CACAACGTCT 

ACAACCAGAT ACTGGGGAAC 

CAAGTGGCGC AGTCGATATT 

CCCCGTGCTG AAATCGATGG 

GGCTCGTTTG ATGTCACAAG 

AAACAAAGAC TATCGCCGTC 



GCCTTTATTG ATGCCGAGCA CGCGTTGGAC 
AGGTGTAAAC ATTGATGAAT TGCTGCTATC 
AAGGCTTAGA AATTGCAGAT GCTTTAGTTT 
ATCGTTATTG ACTCGGTGGC CGCGCTAGTC 
CGAGATGGGT GATGCGCACG TTGGTCTGCA 
CATTGCGCAA GCTGTCAGGC TCTATCAACA 
TTTATTAACC AAATTCGT 



100 
150 
200 
250 
300 
350 
388 



2) INFORMATION FOR SEQ ID NO: 1870 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: ATCC 19434 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1870 

TGAAAGTTCA GGTAAAACAA CAGTTGCACT ACACGCTATT GCAGAAGTAC 50 

AAAAAAATGG CGGAACGGCC GCTTTCATTG ATGCTGAGCA TGCGTTAGAT 100 

CCGCAATATG CACAAAAATT AGGTGTGAAT ATCGATGAAC TACTTCTTTC 150 

ACAGCCTGAC ACAGGAGAAC AAGGTCTAGA GATCGCTGAT GCTTTAGTAT 200 

CAAGTGGGGC TGTAGATATC GTAGTAGTCG ATTCAGTTGC TGCTTTAGTT 250 

CCACGAGCAG AAATCGACGG CGAAATGGGT GACTCACATG TCGGGTTACA 300 

AGCACGTTTG ATGTCTCAAG CATTGCGTAA ACTCTCTGGT TCGATCAACA 350 

AAACAAAAAC AATCGCTATT TTCATCAACC AAATCCGT 388 



2) INFORMATION FOR SEQ ID NO: 1871 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria monocytogenes 

(B) STRAIN: ATCC 15313 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1871 
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AGAGAGTTCC GGTAAAACAA CTGTTGCGCT 
AAGCACAAGG CGGAACAGCA GCATTTATCG 
CCGGCTTATG CTAAAAACCT AGGTGTAAAT 
TCAACCAGAT ACAGGAGAAC AAGCTTTAGA 
GAAGTGGTGC AGTTGATATG TTAGTAATTG 
CCACGTGCTG AAATCGAAGG CGAGATGGGC 
AGCACGTTTA ATGTCCCAAG CATTGCGTAA 
AATCAAAAAC CATTGCTATT TTCATTAACC 



TCATGCAATT GCGGAAGTAC 50 

ATGCTGAGCA TGCGTTGGAT 100 

ATTGATGAAT TATTACTATC 150 

GATTGCTGAA GCTTTAGTTA 200 

ACTCCGTTGC AGCACTTGTA 250 

GATGCTCATG TTGGATTACA 300 

ACTTTCTGGT GTTATTAATA 350 

AAATTCGT 388 



2) INFORMATION FOR SEQ ID NO: 1872 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 388 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 49456 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1872 



AGAGTCATCT GGTAAGACAA CGGTTGCCCT 
AAAAAGAAGG TGGTATTGCT GCCTTTATCG 
CCAGCTTATG CTGCGGCCCT TGGTGTCAAC 
ACAACCAGAC TCAGGAGAGC AAGGTCTTGA 
ACTCAGGAGC CGTGGATCTT GTCGTAGTCG 
CCTCGTGCGG AAATTGATGG AGATATCGGT 
GGCTCGTATG ATGAGCCAGG CTATGCGTAA 
AAACCAAAAC AATTGCCATC TTTATCAACC 



TCATGCAGTT GCGCAAGCAC 50 

ATGCGGAACA TGCCCTTGAT 100 

ATTGACGAAT TGCTCTTGTC 150 

GATTGCAGGA AAATTGATTG 200 

ACTCAGTTGC GGCCCTTGTC 250 

GATAGCCACG TTGGTTTGCA 300 

ACTTGGTGCT TCTATCAATA 350 

AATTGCGT 388 



2) INFORMATION FOR SEQ ID NO: 1873 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 430 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 35037 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1873 
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GAACATGACG 
TTTTGGTTTT 
ATTCGAGCCT 
CGCACGAGGT 
CACCTGAGTC 
TCTGGTTGTG 
ATAGGCTGGG 
CTTCTTTCTG 
GATGATTCTG 



CCGACTTTTT 
ATTGATAGAA 
GCAAACCAAC 
ACAAGGGCCG 
AATCAATTTT 
ACAAGAGCAA 
TCAAGAGCAT 
TGCTTGCGCA 
GCGCRTACAY 



CACGCAATTG GTTGATAAAG 
GCTCCGAGTT TACGCATGGC 
GTGACTGTCT CCAATATCCC 
CAACTGAGTC GATAACGACA 
CCAGCAATTT CAAGACCTTG 
TTCGTCAATA TTCACACCAA 
GTTCCGCATC GATAAAGGCT 
ACAGCGTGAA GGGCAACCGT 
TTCGATGATA 



ATGGCAATTG 
CTGGCTCATC 
CATCAATTTC 
AGGTCAACTG 
TTCACCTGAG 
GGGCTGCAGC 
GCAATACCAC 
TGTCTTACCA 



50 
100 
150 
200 
250 
300 
350 
400 
430 



LO 



L5 



20 



25 



30 



35 



10 



15. 



2) INFORMATION FOR SEQ ID NO: 1874 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 947 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus fumigatus 

(B) STRAIN: ATCC 64746 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1874 

TTGTCGTTGT TGCTGCCTCC GACGGTCAGA TGTAGGTGGA ACATCTTGGG 50 

AAATACGTCG TAAAACACGT CGCTTACGTT TTCGCGAATA GGCCCCAGAC 100 

TCGTGAGCAT TTGCTGCTCG CCCGCCAGGT TGGTGTCCAG AAGATCGTTG 150 

TCTTCGTCAA CAAAATCGAT GCTATTGATG ATCCGGAGAT GCTGGAACTG 200 

GTCGAACTCG AGATGCGTGA GCTGCTGAAC AGCTACGGTT TCGAGGGTGA 250 

AGAGACTCCG ATCATTTTCG GTTCCGCTCT CTGTGCTCTC GAAGGACGCC 300 

GTGACGACAT CGGTAAAGAC AGAATTGAGC AGCTTATGAA CGCTGTCGAC 350 

ACCTGGATCC CCACTCCTCA GCGTGACCTC GACAAACCTT TCTTGATGTC 400 

TGTCGAGGAA GTGTTCTCTA TCGCCGGCCG TGGTACCGTG GCTTCTGGTC 4 50 

GTGTCGAGCG TGGTATCTTG AAGAAGGACT CTGAGGTTGA GATTGTTGGA 500 

GGCTCCTTCG AACCCAAGAA GACCAAAGTC ACCGACATTG AAACCTTCAA 550 

GAAGAGCTGT GATGAATCGC GTGCTGGTGA CAACTCTGGT CTCCTCCTGC 600 

GTGGTATCCG ACGTGAAGAC GTCAAGCGTG GTATGGTCAT TGCTGTTCCC 650 

GGCAGCACCA AGGCTCACGA CAAGTTCCTC GTCTCCATGT ACGTCCTGAC 700 

CGAGGCGGAG GGTGGTCGTC GTACTGGCTT CGGTGCCAAC TACCGTCCCC 750 

AAGTCTTCAT CCGTACTGCA GGTAAGTTCC CGCACACCGT GTCCAGATCT 800 

TCCGAGAGAT TAGCGATATA TGCTAATGAT TCATCAGACG AGGCTGCTGA 850 

CCTCAGCTTC CCTGACGGCG ACCAATCTCG CAGAGTTATG CCTGGTGACA 900 

ACGTCGAGAT GATCCTGAAG ACCCACCACC CTGTTGCTGC TGAGGCT 947 



2) INFORMATION FOR SEQ ID NO: 1875 



983 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 PCT/CA00/01 150 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 923 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
10 (A) ORGANISM: Aspergillus versicolor 

(B) STRAIN: WSA-175 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1875 

15 GCTGCTTCCG ATGGTCAAAT GTACGTCAAC CTTATATACA CCCTCTGATA 50 

TGACAGAATG TCGCCTAACA GCACGCGTGA ACTAGGCCCC AAACCCGTGA 100 

GCACTTGCTG CTTGCCCGCC AGGTCGGTGT CCAGAAGATT GTTGTGTTCG 150 

TCAACAAGGT TGATGCCGTC GATGACCCTG AGATGTTGGA ACTTGTTGAG 200 

CTGGAAATGC GTGAGCTTCT CAGCACTTAC GGCTTCGAAG GCGAGGAGAC 250 

20 CCCTATCATC TTCGGTTCCG CCCTATGCGC CCTCGAGGGT CGCCGCCCCG 300 

ATATCGGTAC TGAGCGAATT GACAGCCTTC TTGAGGCCGT TGACACCTGG 350 

ATCCCTACCC CTCAGCGTGA CCTGGACAAG CCTTTCCTGA TGTCTGTCGA 400 

GGAGGTCTTC TCCATTGCCG GTCGTGGTAC CGTTGCCTCT GGCCGTGTTG 450 

AGCGTGGTCT CCTTAAGAAG GACAGCGAGG TCGAGATTCT CGGAGGTGGT 500 

25 CAGGTCATGA AGACCAAGGT CACTGACATT GAGACATTCA AGAAGCACTG 550 

TGACGAATCC CGTGCTGGTG ACAACTCCGG TCTTCTTCTC CGTGGTATCC 600 

GCCGTGAGGA TGTCAAGCGC GGTATGGTTA TTGCTGCTCC CGCCTCTATC 650 

AAGGCCCACA AGAAGTTCAT GGTCTCCATG TACGTCCTCA CTGAGGCAGA 700 

AGGTGGCCGT CGCAGTGGCT TCGGTGTCAA CTATCGTCCC CAGGCTTACA 750 

30 TTCGCACTGC TGGTAAGTTT TCAAAACATT CGACCTCTCG CCTTAGAAGA 800 

AGAATACCTC TAACTTGTAT TTAGACGAGG CTTGCGACCT TTCTTTCCCC 850 

GATGGCGACA TGAGCCGCCG TGTCATGCCT GGTGACAACG TGGAAATGAT 900 

CCTCAACCTC AACAACCCTG TCG 923 



35 



50 



2) INFORMATION FOR SEQ ID NO: 1876 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 807 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Basidiobolus ranarum 

(B) STRAIN: ATCC 24670 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1876 

CATCATTGTT GTGTCCGCCA CTGATGGTCA AATGCCTCAA ACTCGTGAGC 50 
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ATTTGTTGTT GGCTCGCCAA GTTGGTGTCC AACACTTGGT TGTCTTCATC 100 

AACAAGGTTG ATGCCGTTGA TGATCCTGAA ATGTTGGAGT TGGTCGAGAT 150 

GGAAATGCGT GATTTGCTTT CCCAATACGG TTTCCCCGGA GACAACGTCC 200 

CCATTATCCA GGGTTCCGCT CTCTGCGCTC TTGAGGACCG CAACCCCGAG 250 

5 ATCGGCCGTA ACGCCATCAT GAAGTTGATG GAGGCTGTTG ATAGCAGCAT 300 

CCCCACCCCT GCCCGTGATT TGGACAAACC TTTCCTCATG CCCGTTGAAG 350 

ATGTGTTCTC CATCTCTGGC CGTGGTACTG TTGCCACTGG ACGTGTTGAG 400 

CGTGGTATGG TCACCAAGGG TACTGAAGTT GAAATCGTCG GTATGGGCGA 450 

GCACTTCAAG ACCACCTTGA CCGGTATTGA AATGTTCCAC AAGGAATTGG 500 

LO ACAAGGGTAT GGCTGGTGAC AACATGGGTT GCTTGCTTCG TGGTGTCAAG 550 

CGTGAGCAAG TCCGTCGTGG TATGGTTATC TGTGCCCCCG GATCCGTCAA 600 

GCCACATAAG AAGTTCATGG CTCAGCTCTA CATTCTCACC AAGGATGAGG 650 

GAGGCCGCCA CACTCCCTTC GTCAACAACT ACCGCCCACA AATGTTCTTC 700 

AGAACTGTTG ATGTTACCGC CATCCTTAAG CACCCCCCTG GTACCCCCGA 750 

L5 TGCTGATGAG AAGATGGTCA TGCCCGGAGA CAACGTTCAA CTCGAGTGCG 800 

AGCTCTT 807 



10 2) INFORMATION FOR SEQ ID NO: 1877 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 
15 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

JO (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter gracilis 

(B) STRAIN: ATCC 33236 



J5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1877 



AGTTGTTTCT GCTGCGGATG GTCCTATGCC TCAAACTCGC GAGCATATCT 50 

TGCTTTCTCG TCAAGTAGGC GTTCCATACA TCGTAGTTTT CCTAAACAAA - 100 

ACCGATATGG TCGATGATCC GGATCTTTTA GAGTTAGTTG AAGAGGAAGT 150 

TAGAGATCTT TTAAAAGAGT ATAAATTCCC TGGCGACGAA ACCCCAATCA 200 

[0 TTAAGGGTTC TGCTCTTAAG GCTCTTGAGG AAGCTAAGGC CGGACAAGAC 250 

GGCGAATGGT CTGCAAAGAT TATGGAGCTT ATGGACGCGG TTGATAGCTA 300 

TATTCCAACT CCTGTTCGCG ATACTGATAA AGATTTCCTT CTTCCGATCG 350 

AAGATATTTT CTCGATTTCC GGTCGCGGTA CCGTTGTAAC CGGTAGAATC 4 00 

GAAAAAGGTA TCGTTAAAGT TGGTGATACT ATCGAGATCG TAGGTATTAA 450 

[5 ACCTACTCAG ACTACTACCG TCACTGGCGT TGAGATGTTT . AGAAAAGAGA 500 

TGGATCAAGG TGAAGCCGGC GATAATGTAG GTGTTTTATT GCGCGGTACT 550 

AAGAAAGAGG AAGTAGAGCG CGGTATGGTT TTATGCAAAC CAAAATCGAT 600 

CACTCCTCAT ACTAAATTTG AGGGCGAGGT TTATATCCTA ACTAAAGAAG 650 

AAGGCGGACG CCATACTCCA TTCTTTAATA ATTATAGACC GCAGTTTTAC 700 

50 GTTCGTACGA CAGATGTTAC CGGTTCGATT ACTCTTCCTG AAGGAACCGA 750 

GATGGTTATG CCGGGCGACA ACGTTAAAAT CACCGTTGAG CTAATCGCTC 800 

CGATCG 806 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni jejuni 

(B) STRAIN: ATCC 33292 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1878 

GCTGCAGATG GCCC'TATGCC ACAAACTAGA GAGCACATTC TTCTTTCTCG 50 

TCAAGTAGGC GTTCCATATA TTGTTGTTTT TATGAATAAA GCAGATATGG 100 

TTGATGATGC TGAACTTTTA GAGTTAGTTG AAATGGAAAT TAGAGAATTA 150 

TTAAGCTCTT ATGATTTCCC AGGCGATGAT ACACCTATTA TTTCTGGTTC 200 

TGCTTTAAAA GCTCTTGAAG AAGCTAAAGC TGGACAAGAT GGTGAATGGT 250 

CAGCAAAAAT TATGGATCTT ATGGCTGCAG TTGATAGCTA TATTCCAACT 300 

CCAACTCGTG ATACTGAAAA AGACTTCTTG ATGCCAATTG AAGATGTTTT 350 

CTCAATTTCA GGTCGTGGTA CTGTTGTTAC AGGTAGAATT GAAAAAGGTG 400 

TTGTAAAAGT AGGTGATACT ATCGAAATCG TTGGTATTAA AGATACTCAA 450 

ACAACAACTG TAACAGGTGT TGAAATGTTC AGAAAAGAAA TGGATCAAGG 500 

CGAAGCAGGA GATAACGTAG GTGTTCTTCT TCGTGGTACT AAAAAAGAAG 550 

AAGTTATCCG TGGTATGGTT CTTGCTAAAC CAAAATCAAT TACTCCACAC 600 

ACTGACTTCG AAGCTGAAGT TTATATCTTA AATAAAGATG AAGGTGGTAG 650 

ACATACTCCA TTCTTTAACA ACTATAGACC ACAGTTTTAT GTAAGAACAA 700 

CTGATGTTAC AGGTTCGATT AAATTAGCTG ATGGTGTTGA AATGGTTATG 750 

CCAGGTGAAA ATGTGAGAAT TACTGTAAGC TTGATCGCTC CAGTAGCACT 800 

TGAAGA 806 



2) INFORMATION FOR SEQ ID NO: 1879 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 896 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Coccidioides immitis 

(B) STRAIN: WSA-222 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1879 

ATGTATGCAA CCGAGAGCAC TCCCGGATCT TGGTTTAAAT GGCACTAATA 50 
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TAAGACAGGC CTCAAACTCG AGAGCATTTA 
TATCCAAAAA ATCGTCGTCT TCGTGAACAA 
AAGAGATGTT GGAGCTTGTT GAATTGGAGA 
TACGGTTTCS AGGGTGAAGA AACTCCCATC 
5 TGCCCTCSAA GGAAGACAAC CCGAGATCGG 
TCTTGCAGGC CGTCGACACC TGGATTCCCA 
AAGCCCTTCT TGATGTCCAT TGAGGAAGTG 
AACCGTTGTC TCCGGCCGTG TGGAGCGTGG 
AAGTTGAAAT TGTCGGCGGT TCGCCCGAGC 

10 GATATCGAAA CCTTTAAGAA GTCTTGCGAC 
CTCCGGCTTG CTCCTACGAG GCGTTAAGCG 
TGGTCGTCGC TGTACCAGGA AGTGTCAAGG 
TCGCTTTACG TCCTCACCGA AGCTGAGGGT 
CAGCAAGTAC CGCCCACAGA TGTTCATTCG 

L5 GTGATAATTT CGTTGACATG GTACTGATTG 
TCAGCTCAGC TGGCCCGGAG AAGATCAAGA 
ACAATATCGA AATGATTTGC ACCACCTTGC 
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CI ILICCjCOC 


GTCAGATCGG 


lUU 


GGTTGATGCC 


7\ m /** /"» TV /"■* TV TV 

ATCGAGGACA 




TGCGTGAACT 


CCTAACCAGC 


o n a 


ATTTTTGGCT 


CTGCTCTCTG 


oca 


TGTTACCAAG 


ATTGATGAGC 


300 


CTCCTCAGCG 


TGAGACTGAC 


"5 C /"V 

350 


TTCTCTATTT 


CCGGACGAGG 


400 


TATCCTCAAG 


AAGGACTCCG 


450 


CAATCAAAAC 


CAAGGTTACC 


c f\ r\ 

500 


GAGTCTCGCG 


rn /*»/■* m /■*» tv rn ti tv 

CTGGTGATAA 


CCA 


TGAAGATATT 


AGCCGTGGCA 


600 


CCCATACTGA 


ATTCTTAGTT 


650 


GGGCGCAAAT 


CTGGATTCAG 


700 


CACTGCCGGT 


ATGTAATACT 


750 


AATTCTATAG 


ACGAAGCGGC 


800 


CAAGATGGCT 


ATGCCAGGAG 


850 


ACCCAGTTGC 


CGCCGA 


896 



20 

2) INFORMATION FOR SEQ ID NO: 1880 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 798 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

10 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Erwinia amylovora 

(B) STRAIN: ATCC 14976 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1880 

CTGGTAGTTG CTGCGACTGA CGGCCCAATG CCTCAGACYC GTGAGCACAT 50 

CCTGCTGGGT CGCCAGGTTG GCGTGCCATA CATCATCGTG TTCCTGAACA 100 

AATGTGACAT GGTTGATGAT GAAGAGCTGC TGGAGCTGGT TGAAATGGAA 150 

10 GTMCGTGACC TGCTGTCACA GTACGACTTC CCAGGCGACG ACACGCCAAT 200 

CGTGCRYGGT TCTGCGCTGA AAGCGCTGGA RGGCGAAGCA GAGTGGGAAG 250 

CGAAGATCAT CGAACTGGCT GGCCATCTGG ATAACTACAT CCCGGAACCA 300 

GAGCGTGCGA TTGACAAACC GTTCCTGCTG CCAATTGAAG ACGTGTTCTC 350 

CATCTCTGGC CGTGGTACCG TTGTTACCGG TCGTGTAGAG CGCGGTRTSG 400 

15 TTAAAGTGGG TGAAGAAGTT GAAATCGTTG GTATCAAAGA TACCGTGAAA 4 50 

TCAACCTGTA CCGGCGTTGA GATGTTCCGT AAGCTGCTGG ACGAAGGCCG 500 

TGCGGGTGAG AACTGTGGTA TCCTGCTGCG CGGTATCAAG CGCGAAGATA 550 

TCCAGCGTGG TCAGGTTCTG GCGAAGCCAG GCACCATCAA GCCACACACC 600 

AAGTTCGAGT CAGAAGTTTA TATTCTGTCT AAAGACGAAG GCGGCCGTCA 650 

50 TACTCCGTTC TTCAAAGGCT ACCGTCCACA GTTCTACTTC CGTACTACTG 700 

ACGTGACCGG GACTATCGAA CTGCCAGAAG GCGTTGAGAT GGTGATGCCA 750 

GGCGACAACA TTCAGATGGT TGTGACCCTG ATCCACCCGA TCGCCATG 798 
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2) INFORMATION FOR SEQ ID NO: 1881 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Typhimurium 

(B) STRAIN: ATCC 14028 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1881 



ATCCTGGTTG TTGCTGCGAC TGACGGYCCG ATGCCGCAGA CCCGTGAGCA 50 

CATCCTGCTG GGTCGTCAGG TAGGCGTTCC GTACATCATC GTGTTCCTGA 100 

2 0 ACAAATGCGA CATGGTTGAT GACGAAGAGC TGCTGGAACT GGTTGAAATG 150 

GAAGTTCGYG AACTGCTGTC TCAGTACGAC TTCCCGGGCG ACGACACTCC 200 

GATCGTTCGT GGTTCTGCTC TGAAAGCGCT GGAAGGCGAC GCAGAGTGGG 250 

AAGCGAAAAT CATCGAACTG GCTGGCTTCC TGGATTCTTA CATYCCGGAA 300 

CCAGAGCGTG CGATTGACAA GCCGTTCCTG CTGCCGATCG AAGACGTATT 350 

2 5 CTCCATCTCC GGTCGTGGTA CCGTTGTTAC CGGTCGTGTA GARCGCGGTA 400 

TCATCAAAGT GGGCGAAGAA GTTGAAATCG TTGGTATCAA AGAGACTCAG 450 

AAGTCTACCT GTACTGGCGT TGAAATGTTC CGCAAACTGC TGGACGAAGG 500 

CCGTGCCGGT GAGAACGTAG GTGTTCTGCT GCGTGGTATC AAACGTGAAG 550 

AAATCGAACG TGGTCAGGTA CTGGCTAAGC CGGGCACCAT CAAGCCGCAC 600 

3 0 ACCAAGTTCG AATCTGAAGT GTACATTCTG TCCAAAGATG AAGGCGGCCG 650 

TCATACTCCG TTCTTCAAAG GCTACCGTCC GCAGTTCTAC TTCCGTACTA 700 

CTGACGTGAC TGGCACCATC GAACTGCCGG AAGGCGTAGA GATGGTAATG 7 50 

CCGGGCGACA ACATCAAAAT GGTTGTTACC CTGATCCACC CGATCGCGAT 800 

GGACGACGGT 810 

35 



2) INFORMATION FOR SEQ ID NO: 1882 

4 0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 888 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus cohnii 
50 (B) STRAIN: BM10711 

(C) ACCESSION NUMBER : AF015628 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1882 
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ATGAATTTTT ATTTAGAGGA GTTTAACTTG TCTATTCCCG ATTCAGGTCC 50 

ATACGGTATA ACTTCATCAG AAGACGGAAA GGTATGGTTC ACACAACATA 100 

AGGCAAACAA AATCAGCAGT CTAGATCAGA GTGGTAGGAT AAAAGAATTC 150 

5 GAAGTTCCTA CCCCTGATGC TAAAGTGATG TGTTTAATTG TATCTTCACT 200 

TGGAGACATA TGGTTTACAG AGAATGGTGC AAATAAAATC GGAAAGCTCT 250 

CAAAAAAAGG TGGCTTTACA GAATATCCAT TGCCACAGCC GGATTCTGGT 300 

CCTTACGGAA TAACGGAAGG TCTAAATGGC GATATATGGT TTACCCAATT 350 

GAATGGAGAT CGTATAGGAA AGTTGACAGC TGATGGGACT ATTTATGAAT 400 

10 ATGATTTGCC AAATAAGGGA TCTTATCCTG CTTTTATTAC TTTAGGTTCG 450 

GATAACGCAC TTTGGTTCAC GGAGAACCAA AATAATTCTA TTGGAAGGAT 500 

TACAAATACA GGGAAATTAG AAGAATATCC TCTACCAACA AATGCAGCGG 550 

CTCCAGTGGG TATCACTAGT GGTAACGATG GTGCACTCTG GTTTGTCGAA 600 

ATTATGGGCA ACAAAATAGG TCGAATCACT ACAACTGGTG AGATTAGCGA 650 

15 ATATGATATT CCAACTCCAA ACGCACGTCC ACACGCTATA ACCGCGGGGA 700 

AAAATAGCGA AATATGGTTT ACTGAATGGG GGGCAAATCA AATCGGCAGA 750 

ATTACAAACG ACAAAACAAT TCAAGAATAT CAACTTCAAA CAGAAAATGC 800 

GGAACCTCAT GGTATTACCT TTGGAAAAGA TGGATCCGTA TGGTTTGCAT 850 

TAAAATGTAA AATTGGGAAG CTGAATTTGA ACGAATGA 888 

20 



2) INFORMATION FOR SEQ ID NO: 1883 

2 5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1883 
35 AGCCGCTTGA GCAAATTAAA CTA 23 



40 



2) INFORMATION FOR SEQ ID NO: 1884 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1884 

50 

GTATCCCGCA GATAAATCAC CAC 23 
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2) INFORMATION FOR SEQ ID NO: 1885 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1885 
AGCGAAAAAC ACCTTGCCGA C 



2) INFORMATION FOR SEQ ID NO: 1886 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1886 
GACGCCCGCG CCACCACT 



2) INFORMATION FOR SEQ ID NO: 1887 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1887 
GACGCCCGCG ACACCACTA 



2) INFORMATION FOR SEQ ID NO: 1888 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1888 
GACGCCCGCA ACACCACTA 



2) INFORMATION FOR SEQ ID NO: 1889 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1889 
GTTCGCAACT GCAGCTGCTG 



2) INFORMATION FOR SEQ ID NO: 1890 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1890 
TTCGCAACGG CAGCTGCTG 



2) INFORMATION FOR SEQ ID NO: 1891 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1891 
CCGGAGCTGC CGAICGGG 
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2) INFORMATION FOR SEQ ID NO: 1892 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1892 
CGGAGCTGCC AARCGGGG 



2) INFORMATION FOR SEQ ID NO: 1893 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1893 
GGAGCTGGCG ARCGGGGT 



2) INFORMATION FOR SEQ ID NO: 1894 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1894 
GACCGGAGCT AGCGARCG 
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2) INFORMATION FOR SEQ ID NO: 1895 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1895 
CGGAGCTAGC AARCGGGGT • 19 

15 

2) INFORMATION FOR SEQ ID NO: 1896 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1896 
30 GAAACGGAAC TGAATGAGGC G 21 



35 



2) INFORMATION FOR SEQ ID NO: 1897 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
10 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1897 

45 

CATTACCATG GGCGATAACA G 21 



50 2) INFORMATION FOR SEQ ID NO: 1898 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
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(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1898 
CCATTACCAT GAGCGATAAC AG 



2) INFORMATION FOR SEQ ID NO: 1899 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: 15571 

(C) ACCESSION NUMBER: AF124984 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1899 

ATGCGTTATA TTCGCCTGTG TATTATCTCC CTGTTAGCCA CCCTGCCGCT 50 

GGCGGTACAC GCCAGCCCGC AGCCGCTTGA GCAAATTAAA CTAAGCGAAA 100 

GCCAGCTGTC GGGCCGCGTA GGCATGATAG AAATGGATCT GGCCAGCGGC 150 

CGCACGCTGA CCGCCTGGCG CGCCGATGAA CGCTTTCCCA TGATGAGCAC 200 

CTTTAAAGTA GTGCTCTGCG GCGCAGTGCT GGCGCGGGTG GATGCCGGTG 250 

ACGAACAGCT GGAGCGAAAG ATCCACTATC GCCAGCAGGA TCTGGTGGAC 300 

TACTCGCCGG TCAGCGAAAA ACACCTTGCC GACGGCATGA CGGTCGGCGA 350 

ACTCTGCGCC GCCGCCATTA CCATGAGCGA TAACAGCGCC GCCAATCTGC 400 

TACTGGCCAC CGTCGGCGGC CCCGCAGGAT TGACTGCCTT TTTGCGCCAG 450 

ATCGGCGACA ACGTCACCCG CCTTGACCGC TGGGAAACGG AACTGAATGA 500 

GGCGCTTCCC GGCGACGCCC GCGACACCAC TACCCCGGCC AGCATGGCCG 550 

CGACCCTGCG CAAGCTGCTG ACCAGCCAGC GTCTGAGCGC CCGTTCGCAA 600 

CGGCAGCTGC TGCAGTGGAT GGTGGACGAT CGGGTCGCCG GACCGTTGAT 650 

CCGCTCCGTG CTGCCGGCGG GCTGGTTTAT CGCCGATAAG ACCGGAGCTG 700 

GCGAGCGGGG TGCGCGCGGG ATTGTCGCCC TGCTTGGCCC GAATAACAAA 7 50 

GCAGAGCGCA TTGTGGTGAT TTATCTGCGG GATACCCCGG CGAGCATGGC 800 

CGAGCGAAAT CAGCAAATCG CCGGGATCGG CGCGGCGCTG ATCGAGCACT 8 50 

GGCAACGCTA A 861 



2) INFORMATION FOR SEQ ID NO: 1900 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 780 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: SLK-47 

(C) ACCESSION NUMBER: Y11069 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1900 

CTGTTAGCCA CCCTGCCGCT GGCGGTACAC GCCAGCCCGC AGCCGCTTGA 50 

GCAAATTAAA CTAAGCGAAA GCCAGCTGTC GGGCCGCGTA GGCATGATAG 100 

AAATGGATCT GGCCAGCGGC CGCACGCTGA CCGCCTGGCG CGCCGATGAA 150 

CGCTTTCCCA TGATGAGCAC CTTTAAAGTA GTGCTCTGCG GCGCAGTGCT 200 

GGCGCGGGTG GATGCCGGTG ACGAACAGCT GGAGCGAAAG ATCCACTATC 250 

GCCAGCAGGA TCTGGTGGAC TACTCGCCGG TCAGCGAAAA ACACCTTGCC 300 

GACGGCATGA CGGTCGGCGA ACTCTGCGCC GCCGCCATTA CCATGAGCGA 350 

TAACAGCGCC GCCAATCTGC TACTGGCCAC CGTCGGCGGC CCCGCAGGAT 4 00 

TGACTGCCTT TTTGCGCCAG ATCGGCGACA ACGTCACCCG CCTTGACCGC 4 50 

TGGGAAACGG AACTGAATGA GGCGCTTCCC GGCGACGCCC GCGCCACCAC 500 

TACCCCGGCC AGCATGGCCG CGACCCTGCG CAAGCTGCTG ACCAGCCAGC 550 

GTCTGAGCGC CCGTTCGCAA CGGCAGCTGC TGCAGTGGAT GGTGGACGAT 600 

CGGGTCGCCG GACCGTTGAT CCGCTCCGTG CTGCCGGCGG GCTGGTTTAT 650 

CGCCGATAAG ACCGGAGCTG GCGAGCGGGG TGCGCGCGGG ATTGTCGCCC 700 

TGCTTGGCCC GAATAACAAA GCAGAGCGCA TTGTGGTGAT TTATCTGCGG 750 

GATACCCCGG CGAGCATGGC CGAGCGAAAT 780 



2) INFORMATION FOR SEQ ID NO: 1901 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: U92041 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1901 

ATGCGTTATA TTCGCCTGTG TATTATCTCC CTGTTAGCCA CCCTGCCGCT 50 

GGCGGTACAC GCCAGCCCGC AGCCGCTTGA GCAAATTAAA CTAAGCGAAA 100 

GCCAGCTGTC GGGCCGCGTA GGCATGATAG AAATGGATCT GGCCAGCGGC 150 

CGCACGCTGA CCGCCTGGCG CGCCGATGAA CGCTTTCCCA TGATGAGCAC 200 
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CTTTAAAGTA 
ACGAACAGCT 
TACTCGCCGG 
ACTCTGCGCC 
TACTGGCCAC 
ATCGGCGACA 
GGCGCTTCCC 
CGACCCTGCG 
CGGCAGCTGC 
CCGCTCCGTG 
GCGAGCGGGG 
GCAGAGCGCA 
CGAGCGAAAT 
GGCAACGCTA 



GTGCTCTGCG 
GGAGCGAAAG 
TCAGCGAAAA 
GCCGCCATTA 
CGTCGGCGGC 
ACGTCACCCG 
GGCGACGCCC 
CAAGCTGCTG 
TGCAGTGGAT 
CTGCCGGCGG 
TGCGCGCGGG 
TTGTGGTGAT 
CAGCAAATCG 
A 



GCGCAGTGCT 
ATCCACTATC 
ACACCTTGCC 
CCATGAGCGA 
CCCGCAGGAT 
CCTTGACCGC 
GCAACACCAC 
ACCAGCCAGC 
GGTGGACGAT 
GCTGGTTTAT 
ATTGTCGCCC 
TTATCTGCGG 
CCGGGATCGG 



GGCGCGGGTG 
GCCAGCAGGA 
GACGGCATGA 
TAACAGCGCC 
TGACTGCCTT 
TGGGAAACGG 
TACCCCGGCC 
GTCTGAGCGC 
CGGGTCGCCG 
CGCCGATAAG 
TGCTTGGCCC 
GATACCCCGG 
CGCGGCGCTG 



GATGCCGGTG 
TCTGGTGGAC 
CGGTCGGCGA 
GCCAATCTGC 
TTTGCGCCAG 
AACTGAATGA 
AGCATGGCCG 
CCGTTCGCAA 
GACCGTTGAT 
ACCGGAGCTG 
GAATAACAAA 
CGAGCATGGC 
ATCGAGCACT 



250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
861 



2) INFORMATION FOR SEQ ID NO: 1902 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: 803 

(C) ACCESSION NUMBER: AF164577 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1902 

ATGCGTTATA TTCGCCTGTG TATTATCTCC CTGTTAGCCA CCCTGCCGCT 50 

GGCGGTACAC GCCAGCCCGC AGCCGCTTGA GCAAATTAAA CAAAGCGAAA 100 

GCCAGCTGTC GGGCCGCGTA GGCATGATAG AAATGGATCT GGCCAGCGGC 150 

CGCACGCTGA CCGCCTGGCG "CGCCGATGAA CGCTTTCCCA TGATGAGCAC 200 

CTTTAAAGTA GTGCTCTGCG GCGCAGTGCT GGCGCGGGTG GATGCCGGTG 250 

ACGAACAGCT GGAGCGAAAG ATCCACTATC GCCAGCAGGA TCTGGTGGAC 300 

TACTCGCCGG TCAGCGAAAA ACACCTTGCC GACGGCATGA CGGTCGGCGA 350 

ACTCTGCGCC GCCGCCATTA CCATGAGCGA TAACAGCGCC GCCAATCTGC 400 

TGCTGGCCAC CGTCGGCGGC CCCGCAGGAT TGACTGCCTT TTTGCGCCAG 4 50 

ATCGGCGACA ACGTCACCCG CCTTGACCGC TGGGAAACGG AACTGAATGA 500 

GGCGCTTCCC GGCGACGCCC GCGACACCAC TACCCCGGCC AGCATGGCCG 550 

CGACCCTGCG CAAGCTGCTG ACCAGCCAGC GTCTGAGCGC CCGTTCGCAA 600 

CGGCAGCTGC TGCAGTGGAT GGTGGACGAT CGGGTCGCCG GACCGTTGAT 650 

CCGCTCCGTG CTGCCGGCGG GCTGGTTTAT CGCCGATAAG ACCGGAGCTG 700 

CCGAGCGGGG TGCGCGCGGG ATTGTCGCCC TGCTTGGCCC GAATAACAAA 750 

GCAGAGCGCA TTGTGGTGAT TTATCTGCGG GATACGCCGG CGAGCATGGC 800 

CGAGCGAAAT CAGCAAATCG CCGGGATCGG CGCGGCGCTG ATCGAGCACT 850 

GGCAACGCTA A 861 
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2) INFORMATION FOR SEQ ID NO: 1903 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 896 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: ATCC 700603 

(C) ACCESSION NUMBER: AF132290 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1903 

ATGCGTTATT TTCGCCTGTG TATTATCTCC CTGTTAGCCA CCCTGCCGCT 50 

GGCGGTACAC GCCAGCCCGC AGCCGCTTGA GCAAATTAAA CTAAGCGAAA 100 

GCCAGCTGTC GGGCAGCGTA GGCATGATAG AAATGGATCT GGCCAGCGGC 150 

CGCACGCTGA CCGCCTGGCG CGCCGATGAA CGCTTTCCCA TGATGAGCAC 200 

CTTTAAAGTA GTGCTCTGCG GCGCAGTGCT GGCGCGGGTG GATGCCGGTG 250 

ACGAACAGCT GGAGCGAAAG ATCCACTATC GCCAGCAGGA TCTGGTGGAC 300 

TACTCGCCGG TCAGCGAAAA ACACCTTGCC GACGGCATGA CGGTCGGCGA 350 

ACTCTGTGCC GCCGCCATTA CCATGAGCGA TAACAGCGCC GCCAATCTGC 400 

TGCTGGCCAC CGTCGGCGGC CCCGCAGGAT TGACTGCCTT TTTGCGCCAG 450 

ATCGGCGACA ACGTCACCCG CCTTGACCGC TGGGAAACGG AACTGAATGA 500 

GGCGCTTCCC GGCGACGCCC GCGACACCAC TACCCCGGCC AGCATGGCCG 550 

CGACCCTGCG CAAGCTGCTG ACCAGCCAGC GTCTGAGCGC CCGTTCGCAA 600 

CGGCAGCTGC TGCAGTGGAT GGTGGACGAT CGGGTCGCCG GACCGTTGAT 650 

CCGCTCCGTG CTGCCGGCGG GCTGGTTTAT CGCCGATAAG ACCGGAGCTG 700 

CCAAACGGGG TGCGCGCGGG ATTGTCGCCC TGCTTGGCCC GAATAACAAA 750 

GCAGAGCGGA TTGTGGTGAT TTATCTGCGG GATACGCCGG CGAGCATGGC 800 

CGAGCGAAAT CAGCAAATCG CCGGGATCGG CGCGGCGCTG ATCGAGCACT 850 

GGCAACGCTA ACCCGGCGGT GGCCGCGCGC GTTATCCGGC TCGTAG 896 



2) INFORMATION FOR SEQ ID NO: 1904 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 61 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: JC2926 

(C) ACCESSION NUMBER: AF148851 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1904 

ATGCGTTATA TTCGCCTGTG TATTATCTCC CTGTTAGCCA CCCTGCCGCT 50 

GGCGGTACAC GCCAGCCCGC AGCCGCTTGA GCAAATTAAA CTAAGCGAAA 100 

5 GCCAGCTGTC GGGCCGCGTA GGCATGATAG AAATGGATCT GGCCAGCGGC 150 

CGCACGCTGA CCGCCTGGCG CGCCGATGAA CGCTTTCCCA TGATGAGCAC 200 

CTTTAAAGTA GTGCTCTGCG GCGCAGTGCT GGCGCGGGTG GATGCCGGTG 250 

ACGAACAGCT GGAGCGAAAG ATCCACTATC GCCAGCAGGA TCTGGTGGAC 300 

TACTCGCCGG TCAGCGAAAA ACACCTTGCC GACGGCATGA CGGTCGGCGA 350 

10 ACTCTGCGCC GCCGCCATTA CCATGAGCGA TAACAGCGCC GCCAATCTGC 400 

TACTGGCCAC CGTCGGCGGC CCCGCAGGAT TGACTGCCTT TTTGCGCCAG 450 

ATCGGCGACA ACGTCACCCG CCTTGACCGC TGGGAAACGG AACTGAATGA 500 

GGCGCTTCCC GGCGACGCCC GCGACACCAC TACCCCGGCC AGCATGGCCG 550 

CGACCCTGCG CAAGCTGCTG ACCAGCCAGC GTCTGAGCGC CCGTTCGCAA 600 

15 CGGCAGCTGC TGCAGTGGAT GGTGGACGAT CGGGTCGCCG GACCGTTGAT 650 

CCGCTCCGTG CTGCCGGCGG GCTGGTTTAT CGCCGATAAG ACCGGAGCTA 700 

GCGAGCGGGG TGCGCGCGGG ATTGTCGCCC TGCTTGGCCC GAATAACAAA 750 

GCAGAGCGCA TTGTGGTGAT TTATCTGCGG GATACCCCGG CGAGCATGG'C 800 

CGAGCGAAAT CAGCAAATCG CCGGGATCGG CGCGGCGCTG ATCGAGCACT 850 

20 GGCAACGCTA A 8 61 



2) INFORMATION FOR SEQ ID NO: 1905 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 61 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
30 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: Pseudomonas aeruginosa 

(C) ACCESSION NUMBER: AF096930 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1905 

40 ATGCGTTATA TTCGCCTGTG TATTATCTCC CTGTTAGCCA CCCTGCCGCT 50 

GGCGGTACAC GCCAGCCCGC AGCCGCTTGA GCAAATTAAA CTAAGCGAAA 100 

GCCAGCTGTC GGGCCGCGTA GGCATGATAG AAATGGATCT GGCCAGCGGC 150 

CGCACGCTGA CCGCCTGGCG CGCCGATGAA CGCTTTCCCA TGATGAGCAC 200 

CTTTAAAGTA GTGCTCTGCG GCGCAGTGCT GGCGCGGGTG GATGCCGGTG 250 

45 ACGAACAGCT GGAGCGAAAG ATCCACTATC GCCAGCAGGA TCTGGTGGAC 300 

TACTCGCCGG TCAGCGAAAA ACACCTTGCC GACGGCATGA CGGTCGGCGA 350 

ACTCTGCGCC GCCGCCATTA CCATGAGCGA TAACAGCGCC GCCAATCTAC 400 

TACTGGCCAC CGTCGGCGGC CCCGCAGGAT TGACTGCCTT TTTGCGCCAG 4 50 

ATCGGCGACA ACGTCACCCG CCTTGACCGC TGGGAAACGG AACTGAATGA 500 

50 GGCGCTTCCC GGCGACGCCC GCGACACCAC TACCCCGGCC AGCATGGCCG 550 

CGACCCTGCG CAAGCTGCTG ACCAGCCAGC GTCTGAGCGC CCGTTCGCAA 600 

CGGCAGCTGC TGCAGTGGAT GGTGGACGAT CGGGTCGCCG GACCGTTGAT 650 

CCGCTCCGTG CTGCCGGCGG GCTGGTTTAT CGCCGATAAG ACCGGAGCTA 700 
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GCAAACGGGG TGCGCGCGGG ATTGTCGCCC TGCTTGGCCC GAATAACAAA 

GCAGAGCGCA TTGTGGTGAT TTATCTGCGG GATACGCCGG CGAGCATGGC 

GGAGCGAAAT CAGCAAATCG CCGGGATCGG CGCGGCGCTG ATCGAGCACT 
GGCAACGCTA A 



750 
800 
850 
861 



2) INFORMATION FOR SEQ ID NO: 1906 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1906 
CCTTATTCCC TTTTTTGCGG 20 



2) INFORMATION FOR SEQ ID NO: 1907 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1907 
CACCTATCTC AGCGATCTGT CT 22 



2) INFORMATION FOR SEQ ID NO: 1908 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1908 
AACAGCGGTA. AGATCCTTGA GAG 23 
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10 



15 



2) INFORMATION FOR SEQ ID NO: 1909 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1909 
ATGACTTGGT TAAGTACTCA CC 22 



2) INFORMATION FOR SEQ ID NO: 1910 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1910 
ATGACTTGGT TGAGTACTCA CC • 22 

30 



2) INFORMATION FOR SEQ ID NO: 1911 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1911 
45 CCATAACCAT GGGTGATAAC AC 22 



2) INFORMATION FOR SEQ ID NO: 1912 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1912 
CCATAACCAT GAGTGATAAC AC 



2) INFORMATION FOR SEQ ID NO: 1913 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1913 
CGCCTTGATC ATTGGGAACC 



2) INFORMATION FOR SEQ ID NO: 1914 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1914 
CGCCTTGATC GTTGGGAACC 



2) INFORMATION FOR SEQ ID NO: 1915 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1915 
CGCCTTGATA GTTGGGAACC 
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2) INFORMATION FOR SEQ ID NO: 1916 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1916 
CGTGGGTCTT GCGGTATCAT 



2) INFORMATION FOR SEQ ID NO: 1917 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1917 
CGTGGGTCTG GCGGTATCAT 



2) INFORMATION FOR SEQ ID NO: 1918 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1918 
GTGGGTCTCA CGGTATCATT G 
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2) INFORMATION FOR SEQ ID NO: 1919 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1919 
CGTGGGTCTC TCGGTATCAT T 



2) INFORMATION FOR SEQ ID NO: 1920 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1920 
CGTGGNTCTC GCGGTATCAT 



2) INFORMATION FOR SEQ ID NO: 1921 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1921 
CGTGGGTCTA GCGGTATCAT T 



2) INFORMATION FOR SEQ ID NO: 1922 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1922 
GTTTTCCAAT GATTAGCACT TTTA 



2) INFORMATION FOR SEQ ID NO: 1923 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1923 
GTTTTCCAAT GATAAGCACT TTTA 



2) INFORMATION FOR SEQ ID NO: 1924 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1924 
GTTTTCCAAT GCTGAGCACT TTT 



2) INFORMATION FOR SEQ ID NO: 1925 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1925 
CGTTTTCCAA TGATGAGCAC TTT 
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2) INFORMATION FOR SEQ ID NO: 1926 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single ' 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1926 
GTTTTCCAAT GGTGAGCACT TTT 



2) INFORMATION FOR SEQ ID NO: 1927 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria meningitidis 

(B) STRAIN: MC9690-129 

(C) ACCESSION NUMBER: AF126482 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1927 

ATGAGTATTC AACATTTTCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 200 

GAGCACTTTT AAAGTTCTGC TATGTGGTGC GGTATTATCC CGTGTTGACG 250 

CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCTGCCA 4 00 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 450 

CACAACATGG GGGATCATGT AACTCGCCTT GATCGTTGGG AACCGGAGCT 500 

GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGCAGCAA 550 

TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 600 

TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 650 

ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 700 
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GAGCCGGTGA GCGTGGGTCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT 
GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC 
TATGGATGAA CGAAATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA 
AGCATTGGTA A 



750 
800 
850 
861 
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30 



35 



40 



45 



2) INFORMATION FOR SEQ ID NO: 1928 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 61 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: HB251 

(C) ACCESSION NUMBER: X57 972 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1928 

ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCCTTCCT GTTTTTGCTC ACCCAG1AAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 200 

GAGCACTTTT AAAGTTCTGC TATGTGGCGC GGTATTATCC CGTGTTGACG 250 

CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

GTTAAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCGGCCA 4 00 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 450 

CACAACATGG GGGATCATGT AACTCGCCTT GATCATTGGG AACCGGAGCT 500 

GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGCAGCAA 550 

TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 600 

TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 650 

ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 7 00 

GAGCCGGTGA GCGTGGGTCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT 7 50 

GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC 800 

TATGGATGAA CGAAATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA 850 

AGCATTGGTA A 861 



2) INFORMATION FOR SEQ ID NO: 1929 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 808 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella oxytoca 
5 (B) STRAIN: 26W 

(C) ACCESSION NUMBER: U09188 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1929 



10 ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAGCT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 200 
GAGCACTTTT AAAGTTCTGC TATGTGGTGC GGTATTATCC CGTGTTGACG 1 250 

15 CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCGGCCA 400 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 450 

CACAACATGG GGGATCATGT AACCCGCCTT GATAGTTGGG AACCGGAGCT 500 

20 GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGCAGCAA 550 

TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 600 

TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 650 

ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 700 

GAGCCGGTAA GCGTGGATCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT 750 

25 GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC 800 

TATGGATG 808 



30 2) INFORMATION FOR SEQ ID NO: 1930 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 
35 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



40 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: AF190692 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1930 

45 

ATGAGTATTC AACATTTTCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 200 

50 GAGCACTTTT AAAGTTCTGC TATGTGGCGC GGTATTATCC CGTGTTGACG 250 

CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCTGCCA 4 00 
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ACTTACTTCT 
CACAACATGG 
GAATGAAGCC 
TGGCAACAAC 
TCCCGGCAAC 
ACTTCTGCGC 
GAGCCGGTGA 
GGTAAGCCCT 
TATGGATGAA 
AGCATTGGTA 



GACAACGATC 
GGGATCATGT 
ATACCAAACG 
GTTGCGCAAA 
AATTAATAGA 
TCGGCCCTTC 
GCGTGGGTCT 
CCCGTATCGT 
CGAAATAGAC 
A 



GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 
AACTCGCCTT GATCGTTGGG AACCGGAGCT 
ACGAGCGTGA CACCACGATG CCTGCAGCAA 
CTATTAACTG GCGAACTACT TACTCTAGCT 
CTGGATGGAG GCGGATAAAG TTGCAGGACC 
CGGCTGGCTG GTTTATTGCT GATAAATCTG 
GGCGGTATCA TTGCAGCACT GGGGCCAGAT 
AGTTATCTAC ACGACGGGGA GTCAGGCAAC 
AGATCGCTGA GATAGGTGCC TCACTGATTA 



450 
500 
550 
600 
650 
700 
750 
800 
850 
861 
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2) INFORMATION FOR SEQ ID NO: 1931 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: BM2728 

(C) ACCESSION NUMBER: AF104442 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1931 

ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 200 

GAGCACTTTT AAAGTTCTGC TATGTGGCGC GGTATTATCC CGTGTTGACG 250 

CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCGGCCA 4 00 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 4 50 

CACAACATGG GGGATCATGT AACTCGCCTT GATCGTTGGG AACCGGAGCT 500 

GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGCAGCAA 550 

TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 600 

TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 650 

ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 700 

GAGCCGGTGA GCGTGGGTCT CTCGGTATCA TTGCAGCACT GGGGCCAGAT 750 

GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC 800 

TATGGATGAA CGAAATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA 850 

AGCATTGGTA A 861 



2) INFORMATION FOR SEQ ID NO: 1932 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: . 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 
L0 (A) ORGANISM: Escherichia coli 

(C) ACCESSION NUMBER: AF190695 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1932 

15 ATGAGTATTC AACATTTTCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGCT 200 

GAGCACTTTT AAAGTTCTGC TATGTGGTGC GGTATTATCC CGTGTTGACG 250 

20 CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCTGCCA 400 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 450 

CACAACATGG GGGATCATGT AACTCGCCTT GATCGTTGGG AACCGGAGCT 500 

25 GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGCAGCAA 550 

TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 600 

TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 650 

ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 700 

GAGCCGGTGA GCGTGGGTCT AGCGGTATCA TTGCAGCACT GGGGCCAGAT 750 

3 0 GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC 80.0 

TATGGATGAA CGAAATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA 850 

AGCATTGGTA A 861 



35 



45 



2) INFORMATION FOR SEQ ID NO: 1933 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 861 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: AF190693 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1933 

ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 
TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 
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CTGAAGATCA GTTGGGTGCA CGAGTGGGTT 
AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC 
GAGCACTTTT AAAGTTCTGC TATGTGGTGC 
CCGGGCAAGA GCAACTCGGT CGCCGCATAC 
5 GTTGAGTACT CACCAGTCAC AGAAAAGCAT 
AAGAGAATTA TGCAGTGCTG CCATAACCAT 
ACTTACTTCT GACAACGATC GGAGGACCGA 
CACAACATGG GGGATCATGT AACCCGCCTT 
GAATGAAGCC ATACCAAACG ACGAGCGTGA 

10 TGGCAACAAC GTTGCGCAAA CTATTAACTG 
TCCCGGCAAC AATTAATAGA CTGGATGGAG 
ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG 
GAGCCGGTGA GCGTGGATCT CGCGGTATCA 
GGTAAGCCCT CCCGTATCGT AGTTATCTAC 

15 TATGGATGAA CGAGATAGAC AGATCGCTGA 
AGCATTGGTA A 



ACATCGAGCT 

a x X A> \^\^A X \J Am 


GGATCTCAAC 


150 


GAAGAACGTT 

\ mt J A 11 1VJ1 AmA 1^^^* Aw ^ 


TTCCAATGGT 


200 


GGTATTATCC 

A* A. A A. Am A A A± 


CGTGTTGACG 


250 


ACTATTCTCA 


GAATGACTTG 


300 


PTTACGGATG 


GCATGACAGT 


350 


GAGTGATAAC 


ACTGCGGPCA 


400 


AGGAGPTAAP 


PQPTTTTTTG 

VJ V-» X X X X X X W 


450 


GATCGTCGGG 


AAPPGGAGPT 


500 


PACCACGATG 


PPTfJPAGPAA 


550 
*j -j \j 


PPPAAPTAPT 


TAPTPTAPPT 


\J\J\J 


(jCbts A 1 AAACj 




/•en 
ODU 


GTTTATTGCT 


GATAAATCTG 


700 


TTGCAGCACT 


GGGGCCAGAT 


750 


ACGACGGGGA 


GTCAGGCAAC 


800 


GATAGGTGCC 


TCACTGATTA 


850 






861 



20 2) INFORMATION FOR SEQ ID NO: 1934 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 
25 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1934 

GAACGCCAGC GCGAAATTCA AAAAG 25 



35 



45 



2) INFORMATION FOR SEQ ID NO: 1935 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 bases 
40 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1935 
AGCTCGGCAT ACTTCGACAG G 21 

50 

2) INFORMATION FOR SEQ ID NO: 1936 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY-: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1936 
TACCACCCGC ACGGC 



PCT/CA00/01150 



15 



15 2) INFORMATION FOR SEQ ID NO: 1937 



20 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



25 



(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1937 



45 



CGGAGTCGCC GTCGATG 



17 



30 



35 



40 



2) INFORMATION FOR SEQ ID NO: 1938 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1938 
CCGCGCACCA TTGCTTCGTA CACTGAGGAG TCTCCGCGCG G 



41 



2) INFORMATION FOR SEQ ID NO: 1939 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 4 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1939 
CGACCCGGAT GGTAGTATCG ATAATGATCC GCCAGCGGCC GGGTCG 



2) INFORMATION FOR SEQ ID NO: 1940 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1940 
GTATCGTTGG TGACGTAAT 



2) INFORMATION FOR SEQ ID NO: 1941 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1941 
GCAATGGTCC GTTTAAGT 



2) INFORMATION FOR SEQ ID NO: 1942 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1942 
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5 2) INFORMATION FOR SEQ ID NO: 1943 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 bases 

(B) TYPE: Nucleic acid 
LO (C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
L5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1943 

- GCTGGTGGAC GGCCAG 16 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1944 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1944 
TTTCGCCGCC ATGCGTTAC 19 



2) INFORMATION FOR SEQ ID NO: 1945 



(i) SEQUENCE CHARACTERISTICS : 
^0 (A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1945 
CGGCGACTAC GCGGTAT 17 

50 

2) INFORMATION FOR SEQ ID NO: 1946 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1946 
CGGCGACTTC GCGGTAT 



2) INFORMATION FOR SEQ ID NO: 1947 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1947 
CGGTATACGG CACCATCGT 



2) INFORMATION FOR SEQ ID NO: 1948 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1948 
GCGGTATACA ACACCATCG 



2) INFORMATION FOR SEQ ID NO: 1949 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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50 



WO 01/23604 PCT/CA00/01150 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1949 
CGGTATACGC CACCATCGT 19 



2) INFORMATION FOR SEQ ID NO: 1950 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 15 bases 
15 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1950 
GGCGACATCG CCTGC 15 



2) INFORMATION FOR SEQ ID NO: 1951 



(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1951 
GGCGACAGAG CCTGCTA 17 

40 



2) INFORMATION FOR SEQ ID NO: 1952 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1952 
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CCTGCTATGG AGCGATGGT 19 



2) INFORMATION FOR SEQ ID NO: 1953 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 bases 
L0 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



L5 



10 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1953 
CGCCTGCTAT AAAGCGATGG T 21 



2) INFORMATION FOR SEQ ID NO: 1954 



(i) SEQUENCE CHARACTERISTICS: 
>5 (A) LENGTH: 589 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

10 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 13883 

15 <C) ACCESSION NUMBER: AF052258 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1954 

ACACCGGTCA ACATTGAGGA AGAGCTTAAG AACTCTTATC TGGATTATGC 50 

10 GATGTCGGTC ATTGTTGGCC GTGCGCTGCC GGATGTCCGA GATGGCCTGA 100 

AGCCGGTACA CCGTCGCGTA CTTTACGCCA TGAACGTATT GGGCAATGAC 150 

TGGAACAAAG CCTATAAAAA ATCAGCCCGT GTCGTTGGTG ACGTAATCGG 200 

TAAATACCAC CCGCACGGCG ACTCCGCGGT ATACGACACC ATCGTGCGTA 250 

TGGCGCAGCC GTTCTCGCTG CGTTACATGC TGGTGGACGG CCAGGGTAAC 300 

[S TTTGGTTCCA TCGACGGCGA CTCCGCCGCG GCGATGCGTT ATACCGAAAT 350 

TCGTCTGGCG AAAATCGCTC ATGAGCTGAT GGCCGATCTT GAAAAAGAGA 400 

CGGTCGATTT CGTCGACAAC TATGACGGTA CGGAGCGTAT TCCGGACGTC 4 50 

ATGCCGACCA AAATTCCTAA CCTGCTGGTG AACGGCGCCT CCGGGATCGC 500 

CGTAGGGATG GCCACCAACA TACCGCCACA TAACCTGACG GAAGTGATTA 550 

JO ACGGCTGTCT GGCGTATGTT GACGATGAAG ACATCAGCA 589 
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2) INFORMATION FOR SEQ ID NO: 1955 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1955 
CGCGCAACCA TTGCTTCGTA CACTGAGGAG TCTGCGCG 



2) INFORMATION FOR SEQ ID NO: 1956 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 989 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Candida inconspicua 

(B) STRAIN: ATCC 16783 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1956 

CGGTAAAACC ACTTTAACTG CTGCAATCAC CAAAGTTTTA TCTGAAGAAG 50 

GTGGTGCAGA TTTCTTAGAT TATTCATCTA TTGATAAAGC ACCAGAAGAG 100 

AGAGCTAGAG GTATTACCAT TTCTACTGCT CATGTTGAAT ATGAAACTCC 150 

AAACAGACAT TATTCACATG TTGATTGTCC AGGTCATCAA GATTATATTA 200 

AGAACATGAT TACTGGTGCA GCTCAAATGG ATGGTGCTAT TATTGTTGTT 250 

GCAGCTACTG ATGGTCAAAT GCCACAAACT AAGGAACATT TATTATTAGC 300 

TAGACAAGTT GGTGTTCAAC ATTTAGTTGT TTTTGTTAAC AAGTGTGATA 350 

CCGTTGATGA TCCAGAAATG TTRGAATTAG TTGAAATGGA AATGAGAGAA 400 

TTATTGACCG AATATGGATT YGATGGTGAT AACACACCAG TTATTATGGG 450 

TTCTGCATTA ATGGCATTAG AAGGTAAGAG ACCAGAAGTT GGTAAAGAAT 500 

CTATTGTTAA GTTAATGGAA GCTGTTGATA CTTGGATTCC AACTCCACAA 550 

AGAGACTTAG AAAAACCATT CTTATTACCA ATTGATGAAG TTTTCTCTAT 600 

TTCTGGTAGA GGTACTGTTG TTTCTGGTAC TGTTGATAGA GGTACATTAA 650 

AGAAGGGTGA AGAAGTTGAA ATTGTTGGTG GTAAAGAAGG TGTTATTAAG 700 

ACTACTGTTA CCGGTATTGA AATGTATCAC AAGGAATTAG ATCAAGCACA 750 

AGCTGGTGAT ACTCCAGGTA TTTTGTTAAG AGGTGTTAAG AGAGATCAAA 800 

TTGCAAGAGG TCAAATTCTT GCAAAGCCAG GWTCTGTTAA GGCATACAAG 850 

AAGTTCTTAT CATCATTATA CATTTTAACA AAGGAAGAAG GTGGTAGACA 900 

TACTCCATTT TCTGAAAATT ACAGACCTCA AATGTACATT AGAACTTCCA 950 

ATGTTAATGT TACTTTGAAG TTCCCAGAAA CTGAAGAAG 989 
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PCT/CA00/0115O 



LO 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 991 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

<ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida utilis 
'(B) STRAIN: ATCC 22023 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1957 

GGTAAGACCA CCCTTACTGC CGCCATCACC AAGTGCCTTG CTGAGAAGGG 50 

AGGTGCCTCG TTCTTGGACT ACAGTGCCAT CGACAAGGCG CCAGAGGAGA 100 

GAGCAAGAGG TATCACCATC TCCACTGCGC ACGTTGAGTA TGAAACTGCC 150 

20 AACAGACACT ACTCGCACGT TGACTGTCCA GGTCACGCTG ATTACATCAA 200 

GAACATGATT ACCGGTGCTG CGCAGATGGA CGGTGCTATC ATTGTCGTTG 250 

CAGCCACTGA CGGTCAGATG CCACAGACCA GAGAACACTT GTTGCTTGCC 300 

AGACAAGTTG GTGTCCAGCA CATTGTTGTC TTCGTCAACA AGGTTGACAC 350 

CATCGACGAC CCTGAGATGC TTGAGCTTGT TGAAATGGAG ATGAGAGAGT 400 

25 TGCTTACTTC GTATGGATTT GACGGTGATA ACACCCCAGT TATCATGGGT 450 

TCTGCTTTGT GTGCTTTGGA AGGCCGTGAG CCAGAGATTG GTGCTAAGGC 500 

CATTGACAAG TTGATGGAGG CCATTGATGA GTACATCCCA ACTCCTCAGA 550 

GAGACCTGGA AAAGCCATTC YTGATGGGTG TTGAAGACGT GTTCTCGATC 600 

TCTGGTAGAG GTACCGTTGT CACRGGCCGT GTTGAGCGTG GTAACTTGAA 650 

30 GAAAGGTGAT GAAATTGAAC TTGTTGGCTA CAACAAGAAC CCAATCAAGA 700 

CCACCGTCAC CGGTATCGAA ATGTTCAAGA AGGAGTTGGA ATCTGCCATG 750 

GCTGGTGACA ACTGTGGTAT CTTGTTGCGT GGTATCAAGA GAGATGACGT 800 

CAAGAGAGGT ATGGTTGCTG CTAAGCCAGG CTCCGTCTCT GCACACACCA 850 

AGTTCCTCGC TTCCTTGTAC ATCCTGACRA AGGAGGAAGG TGGTCGTCAC 900 

35 AGTGCCTTTG CTGAGAACTA CAGACCACAG ATGTTCATCA GAACCGGAGA 950 

TGTCACCACC ATCTTGACAT GGCCAGAGGA GCACGCTGAC C 991 



40 2) INFORMATION FOR SEQ ID NO: 1958 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 985 bases 

(B) TYPE: Nucleic acid 
45 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

50 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida zeylanoides 

(B) STRAIN: ATCC 7351 
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(xi) SEQUENCE DESCRIPTION: SEQ 

CGGTAAGACC ACTTTGACCG CCGCCATCAC 
GTGGTGCTTC CTTCTTGGAC TACGGGTCCA 
AGAGCCAGAG GTATTACTAT CTCGACTGCC 
TAAGAGACAC TACGCCCACG TTGATTGCCC 
AGAACATGAT CACTGGTGCC GCCCAAATGG 
GCTGCTTCTG ATGGCCAAAT GCCGCAGACC 
CAGACAGGTT GGTGTGCAGA ACTTGGTTGT 
CCATCGACGA CCCCGAAATG TTGGAGTTGG 
TTGTTGACCC ACTACGGCTT TGACGGTGAC 
TTCGGCGTTG TGTGCCTTGG AAGACAGGCA 
CCATCATGAA GTTGTTGGAC GCTGTCGACG 
AGAGACTTGG AGCAACCATT TTTGATGCCC 
CTCTGGCAGA GGTACTGTTG TCACCGGTCG 
AGAAGGGTGA GGAGATTGAG ATTGTTGGCG 
ACTACCGTCA CCGGCATTGA GATGTTCAAG 
GGCGGGCGAC AACGCCGGGA TCTTGTTGAG 
TCTCGAGAGG TATGGTTTTG GCCAAGCCCG 
AAGGTGTTGG CGTCGCTTTA CATCTTGACC 
CTCGCCCTTT GGTGAGAACT ACAAGCCACA 
ATGTCACTGG TACTTTGAGG TTCCCCGCCG 



PCT/CA00/01150 

ID NO: 1958 

CAAGGTGTTG AGCGCCAAAG 50 

TCGACAGAGC CCCTGAGGAG 100 

CACGTTGAGT ACGAGACCGA 150 

TGGTCACGCT GATTACATCA 200 

ACGGTGCCAT TATTGTCGTT 250 

AGAGAGCACT TGTTGCTTGC 300 

GTTTGTTAAC AAGGTGGACA 350 

TGGAGATGGA AATGAGAGAA 4 00 

AACACCCCTG TCATCATGGG 4 50 

GCCTGAGATT GGCGAGCAAG 500 

AGTACATTCC CACTCCTCAG 550 

GTTGAGGATG TTTTCTCCAT 600 

TGTTGAGAGA GGCTCATTGA 650 

ACTTCCCCAA GCCCTTCAAG 700 

AAGGAGTTGG ATGCCGCGAT 750 

AGGTGTCAAG AGAGACGAGG 800 

GTACTGTCAC TTCGCACACC 850 

AAAGAGGAAG GTGGCCGCCA 900 

GTTATTCATC AGAACCTCCG 950 

GTGAG 985 



2) INFORMATION FOR SEQ ID NO: 1959 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 3 bases . 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

r 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL" SOURCE: 

(A) ORGANISM: Candida catenulata 

(B) STRAIN: ATCC 10565 

(C) ACCESSION NUMBER: 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1959 

CGGTAAGACC ACCTTGACTG CCGCCATCAC CAAGGTTCTC TCCGAGAAGG 50 

GTGGTGCCGA CTTCTTGGAC TACGGTGCCA TTGACAGAGC CCCCGAGGAG 100 

CGTGCCCGTG GTATCACCAT CTCCACTGCC CACGTTGAGT ACGAGACTGA 150 

CAACCGTCAC TACGCCCACA TTGACTGTCC CGGTCACGCT GATTACATCA 200 

AGAACATGAT TACCGGTGCC GCCCAGATGG ACGGTGCCAT TATTGTCCTT 250 

GCTGCTACTG ACGGTGCCAT GCCCCAGACC CGCGAGCACT TGCTTCTCGC 300 

CCGTCAGGTT GGTATCCAGG AATTGGTTGT GTTTGTGAAC AAGGTTGACA 350 

CCATCGACGA CCCCGAGATG TTGGAGCTCG TTGAGATGGA GATCCGCGAG 400 

TTGTTGTCTG AGTTCGGTTT TGACGGTGAC AACACCCCCG TCATCATGGG 450 

TTCCGCTTTG TGCGCTTTGG AGGGCAAGCA GCCCGAGATT GGTGAGCAGG 500 

CTATCACCAA GTTGATGGCC GCCGTTGACG AGCACATCCC CACCCCCCAG 550 
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CGTGACTTGG AGCAGCCTTT 
CTCTGGCCGT GGTACCGTGG 
AGAAGGGTGA GGAGATTGAG 
GTGACTGTTA CTGGTATTGA 
5 GGCTGGTGAC AACGCCGGTA 
TGTCTCGTGG TATGGTTTTG 
AAGGTTTTGG CTTCGCTTTA 
GACCGGCTTC GGCTCCAACT 
ACGTCACTGG TACCCTCACC 



CTTGATGCCT GTTGAGGGTG 
TGACTGGTAA GGTTGCCCGT 
ATTGTTGGCA ACTTTGACAA 
GATGTTCAAG AAGGAGTTGG 
TCTTGTTGCG TGGTGTCAAG 
GCCAAGCCCG GCACTGTTGT 
CATCTTGACC CAGGAGGAGG 
ACAAGCCCCA GTTGTTCTTG 
TTC 



TTTTCTCTAT 600 

GGTGTCCTCA 650 

GCCCTACAAG 700 

ACCAGGCCAT 750 

CGTGACGAGG 800 

CTCGCACAAG 850 

GTGGCCGTAA 900 

CGCACTACCG 950 

973 



2) INFORMATION FOR SEQ ID NO: 1960 

L5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 985 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



10 



;o 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida krusei 
15 (B) STRAIN: ATCC 28870 

(xi) SEQUENCE DESCRIPTION: SEQ ID. NO: 1960 

AAGACTACCT TGACTGCTGC AATCACCAAG GTCTTAGCTG ATCAAGGTGG 50 

10 TGCTGATTTC TTAGATTATG CATCTATTGA CAAGGCTCCT GAAGAAAGAG 100 

CAAGAGGTAT TACTATCTCT ACTGCTCACG TTGAGTATGA AACCCCAAAC 150 

AGACATTATT CTCATGTCGA TTGTCCTGGC CATCAAGATT ATATTAAGAA 200 

TATGATTACT GGTGCTGCAC AAATGGATGG TGCTATTATT GTTGTTGCTG 250 

CTACTGATGG TCAAATGCCA CAAACTAAGG AACATTTATT ATTAGCAAGA 300 

)5 CAAGTTGGTG TTCAACATTT AGTTGTCTTT GTTAATAAAT GTGACACCAT 350 

TGATGACCCA GAAATGTTGG AATTAGTTGA AATGGAAATG AGAGAACTAT 400 

TGTCTGAATA TGGTTTTGAT GGTGATAACA CTCCAGTTAT TATGGGTTCT 450 

GCATTGATGG CTTTAGAAGA CAAGAGACCT GAAGTTGGTA AGGAATCTAT 500 

TTTAAAGTTA ATGGAAGCYG TTGACACATG GATTCCAACC CCAGAGAGAG 550 

10 ATTTAGAAAA ACCATTTTTG TTACCTATTG ATGAAGTTTT CTCAATCTCT 600 

GGTAGAGGTA CTGTCGTTTC TGGTACTGTC GAAAGAGGTA CTTTGAAGAA 650 

GGGTGAAGAA GTTGAAATTG TTGGTGGTAA GGATGGTTCT ATTAAAACTA 700 

CTGTCACAGG TATTGAAATG TATCACAAGG AATTAGACCA AGCGCAAGCA 750 

GGTGATACTC CAGGTATTTT ATTAAGAGGT GTCAAGAGAG ACCAAATCAA 800 

[5 GAGAGGTCAA ATTTTAGCAA AGCCAGATTC CGTTAAGGCA TACAAGAAGT 850 

TCTTGGCTTC CCTTTATATC TTAACCAAGG AAGAAGGTGG TAGACATACA 900 

CCATTCTCTG AAAACTACAG ACCACAAATG TACATCAGAA CTACCAATGT 950 

TAACGTTACT TTGAAGTTCC CAGACACTGA AGAAG 985 



2) INFORMATION FOR SEQ ID NO: 1961 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1961 
GCTCAAGGCA GATGGCATTC CC 



2) INFORMATION FOR SEQ ID NO: 1962 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1962 
GGACAAGGCG GTTGCGTTTG AT 



2) INFORMATION FOR SEQ ID NO: 1963 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1963 
CATTCCCGTC TCGCTCGACA GT 



2) INFORMATION FOR SEQ ID NO: 1964 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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5 



25 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1964 
ATCTGCCTGC CCGTCTTGC 19 



L0 2) INFORMATION FOR SEQ ID NO: 1965 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 bases 

(B) TYPE: Nucleic acid 

15 (C) STRANDEDNESS: Double ( 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Plasmid pGSOS 

(C) ACCESSION NUMBER: M36657 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1965 



ATGAATAAAT CGCTCATCAT TTTCGGCATC GTCAACATAA CCTCGGACAG 50 

TTTCTCCGAT GGAGGCCGGT ATCTGGCGCC AGACGCAGCC ATTGCGCAGG 100 

CGCGTAAGCT GATGGCCGAG GGGGCAGATG TGATCGACCT CGGTCCGGCA 150 
TCCAGCAATC CCGACGCCGC GCCTGTTTCG TCCGACACAG AAATCGCGCG * 200 

30 TATCGCGCCG GTGCTGGACG CGCTCAAGGC AGATGGCATT CCCGTCTCGC 250 

TCGACAGTTA TCAACCCGCG ACGCAAGCCT ATGCCTTGTC GCGTGGTGTG 300 

GCCTATCTCA ATGATATTCG CGGTTTTCCA GACGCTGCGT TCTATCCGCA 350 

ATTGGCGAAA TCATCTGCCA AACTCGTCGT TATGCATTCG GTGCAAGACG 400 

GGCAGGCAGA TCGGCGCGAG GCACCCGCTG GCGACATCAT GGATCACATT 450 

35 GCGGCGTTCT TTGACGCGCG CATCGCGGCG CTGACGGGTG CCGGTATCAA 500 

ACGCAACCGC CTTGTCCTTG ATCCCGGCAT GGGGTTTTTT CTGGGGGCTG 550 

CTCCCGAAAC CTCGCTCTCG GTGCTGGCGC GGTTCGATGA ATTGCGGCTG 600 

CGCTTCGATT TGCCGGTGCT TCTGTCTGTT TCGCGCAAAT CCTTTCTGCG 650 

CGCGCTCACA GGCCGTGGTC CGGGGGATGT CGGGGCCGCG ACACTCGCTG 700 

10 CAGAGCTTGC CGCCGCCGCA GGTGGAGCTG ACTTCATCCG CACACACGAG 750 

CCGCGCCCCT TGCGCGACGG GCTGGCGGTA TTGGCGGCGC TGAAAGAAAC 800 

CGCAAGAATT CGTTAA 816 



15 

2) INFORMATION FOR SEQ ID NO: 1966 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 bases 
50 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1966 
5 CATGCCAGTC TTGCCAACG 19 



2) INFORMATION FOR SEQ ID NO: 1967 



L0 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
L5 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1967 

20 

CAGCAATAAG TAATCCAGCG ATG 23 



25 2) INFORMATION FOR SEQ ID NO: 1968 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) . TYPE: Nucleic acid 
30 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1968 

GGAGAGATTT CACCGCATAG 20 



40 



30 



2) INFORMATION FOR SEQ ID NO: 1969 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 22 bases 
45 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1969 

AGCCAACCAT CATGCTATTC CA 22 
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2) INFORMATION FOR SEQ ID NO: 1970 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1206 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Transposon TnlO 
• (C) ACCESSION NUMBER: J01830 

L5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1970 

ATGAATAGTT CGACAAAGAT CGCATTGGTA ATTACGTTAC TCGATGCCAT 50 

GGGGATTGGC CTTATCATGC CAGTCTTGCC AACGTTATTA CGTGAATTTA 100 

TTGCTTCGGA AGATATCGCT AACCACTTTG GCGTATTGCT TGCACTTTAT 150 

>0 GCGTTAATGC AGGTTATCTT TGCTCCTTGG CTTGGAAAAA TGTCTGACCG 200 

ATTTGGTCGG CGCCCAGTGC TGTTGTTGTC ATTAATAGGC GCATCGCTGG . 250 

ATTACTTATT GCTGGCTTTT TCAAGTGCGC TTTGGATGCT GTATTTAGGC 300 

CGTTTGCTTT CAGGGATCAC AGGAGCTACT GGGGCTGTCG CGGCATCGGT 350 

CATTGCCGAT ACCACCTCAG CTTCTCAACG CGTGAAGTGG TTCGGTTGGT 400 

IS TAGGGGCAAG TTTTGGGCTT GGTTTAATAG CGGGGCCTAT TATTGGTGGT 450 

TTTGCAGGAG AGATTTCACC GCATAGTCCC TTTTTTATCG CTGCGTTGCT .500 

AAATATTGTC ACTTTCCTTG TGGTTATGTT TTGGTTCCGT GAAACCAAAA 550 

ATACACGTGA TAATACAGAT ACCGAAGTAG GGGTTGAGAC GCAATCGAAT 600 

TCGGTATACA TCACTTTATT TAAAACGATG CCCATTTTGT TGATTATTTA 650 

JO TTTTTCAGCG CAATTGATAG GCCAAATTCC CGCAACGGTG TGGGTGCTAT 700 

TTACCGAAAA TCGTTTTGGA TGGAATAGCA TGATGGTTGG CTTTTCATTA 750 

GCGGGTCTTG GTCTTTTACA CTCAGTATTC CAAGCCTTTG TGGCAGGAAG 800 

AATAGCCACT AAATGGGGCG AAAAAACGGC AGTACTGCTC GAATTTATTG 850 

CAGATAGTAG TGCATTTGCC TTTTTAGCGT TTATATCTGA AGGTTGGTTA 900 

IS GATTTCCCTG TTTTAATTTT ATTGGCTGGT GGTGGGATCG CTTTACCTGC 950 

ATTACAGGGA GTGATGTCTA TCCAAACAAA GAGTCATGAG CAAGGTGCTT 1000 

TACAGGGATT ATTGGTGAGC CTTACCAATG CAACCGGTGT TATTGGCCCA 1050 

TTACTGTTTA CTGTTATTTA TAATCATTCA CTACCAATTT GGGATGGCTG 1100 

GATTTGGATT ATTGGTTTAG CGTTTTACTG TAT TAT TAT C CTGCTATCGA 1150 

tO TGACCTTCAT GTTAACCCCT CAAGCTCAGG GGAGTAAACA GGAGACAAGT 1200 

GCTTAG 1206 



L5 2) INFORMATION FOR SEQ ID NO: 1971 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

50 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1971 
CYGACTGYGC CATCCTYATC A 21 

2) INFORMATION FOR SEQ ID NO: 1972 



(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1972 
MGICAGCTCA TYITTGCWKS C 21 

20 



2) INFORMATION FOR SEQ ID NO: 1973 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1973 
35 RACACCRGIY TTGGWITCCT T 21 



2) INFORMATION FOR SEQ ID NO: 1974 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1974 

50 

ACAAGGGITG GRMSAAGGAG AC 22 
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2) INFORMATION FOR SEQ ID NO: 1975 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 21 bases 
5 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



L0 



L5 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1975 
TGRCCRGGGT GGTTRAGGAC G 21 



2) INFORMATION FOR SEQ ID NO: 1976 



(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1976 - 
GATGGAYTCY GTYAAITGGG A 21 

30 



2) INFORMATION FOR SEQ ID NO: 1977 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

10 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1977 
15 GATGGAYTCY GTYAARTGGG A 21 



2) INFORMATION FOR SEQ ID NO: 1978 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1978 
CATCITGYAA TGGYAATCTY AAT 23 



2) INFORMATION FOR SEQ ID NO: 1979 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1979 
CATCYTGYAA TGGYAASCTY AAT 23 

25 

> » 
2) INFORMATION FOR SEQ ID NO: 1980 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1980 
40 TCRATGGCIT CIAIRAGRGT YT 22 



2) INFORMATION FOR SEQ ID NO: 1981 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1981 
TGGACACCIS CAAGIGGKCY G 



PCT/CAOO/01150 



21 



2) INFORMATION FOR SEQ ID NO: 1982 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1982 
TGGACACYIS CAAGIGGKCY G 21 

20 



2) INFORMATION FOR SEQ ID NO: 1983 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

30 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1983 
35 CYGAYTGCGC YATICTCATC A 21 



40 



2) INFORMATION FOR SEQ ID NO: 1984 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
45 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1984 

50 

CYGAYTGYGC YATYCTSATC A 21 



1028 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CAOO/01150 



LO 



L5 



2) INFORMATION FOR SEQ ID NO: 1985 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1383 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

<ii) MOLECULE TYPE: Genomic DNA 

(vi)- ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus neoformans 

(B) STRAIN: Ml-106 

(C) ACCESSION NUMBER: U81804 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1985 



ATGGGTAAGG ACAAGCTGCA CGTCAACGTC GTTGTTATCG GTCACGTCGA 50 

CTCCGGTAAG TCGACCACCA CCGGTCACTT GATCTACAAG TGCGGTGGTA 100 

20 TCGACAAGCG AACCATTGAG AAGTTCGAGA AGGAGGCTCA AGAGCTCGGA 150 

AAGTCTTCTT TCAAGTACGC TTGGGTTCTT GACAAGCTTA AGGCCGAGCG 200 

AGAGCGAGGT ATCACCATCG ACATTGCTCT TTGGAAGTTC GAGACCCCTA 250 

AGTACCAGGT TACCGTCATT GACGCCCCCG GTCACCGAGA CTTCATCAAG 300 

AACATGATCA CCGGTACCTC CCAGGCTGAC TGTGCCATCC TCATCATTGC 350 

25 CACCGGTATC GGTGAGTTCG AGGCTGGTAT CTCCAAGGAC GGTCAGACCC 400 

GAGAGCACGC CCTCCTCGCC TTCACCCTCG GTGTCAGGCA GCTCATTGTT 4 50 

GCTTGCAACA AGATGGACAC CTGCAAGTGG TCTGAGGACC GATTCAACGA 500 

AATCGTCAAG GAGACCAACG GTTTCATCAA GAAGGTTGGT TACAACCCCA 550 

AGGCTGTCCC CTTCGTCCCC ATCTCTGGTT GGCACGGTGA CAACATGTTG 600 

30 GAGGAGACCA CCAACATGCC CTGGTACAAG GGATGGACCA AGGAGACCAA 650 

GTCCGGTGTT TCCAAGGGTA AGACCCTTCT CGAGGCCATC GACGCCAGTA 700 

GGCCCCCTAC CCGACCCACC GACAAGCCCC TCCGTCTCCC TCTCCAGGAC 750 

GTCTACAAGA TCGGTGGTAT CGGCACAGTC CCTGTCGGCC GAGTCGAGAC 800 

CGGTGTCATC AAGGCCGGTA TGGTCGTCAA GTTCGCCCCC ACCAACGTCA 850 

3.5 CCACTGAAGT CAAGTCCGTT GAGATGCACC ACGAGCAGAT CCCCGAGGGT 900 

CTCCCCGGAG ACAACGTTGG TTTCAACGTC AAGAACGTTT CCATCAAGGA 950 

CATCCGACGA GGTAACGTCT GTGGTGACTC CAAGAACGAC CCCCCTATGG 1000 

AGGCTGCTTC TTTCAACGCC CAGGTTATCG TCCTTAACCA CCCTGGTCAG 1050 

ATCGGTGCCG GTTACACCCC CGTTCTCGAC TGTCACACTG CCCACATTGC 1100 

10 TTGCAAGTTC TCTGAGTTGA TCGAGAAGAT TGACCGACGA ACCGGTAAGG 1150 

TCATGGAGGC CGCCCCCAAG TTCGTCAAGT CTGGTGACGC CGCCATTGTC 1200 

AAGCTTGTTT CCCAGAAGCC TCTCTGTGTT GAGACCTACG CCGACTACCC 1250 

CCCTCTTGGT CGATTCGCCG TCCGAGACAT GCGACAGACC GTTGCCGTTG 1300 

GTGTTATTAA GAGTGTGGAG AAGTCCGATG GGAAGAGCGG CAAGGTTACC 1350 

15 AAGGCCGCCG AGAAGGCTGC TAAGAAGAAG TAA 1383 



50 



2) INFORMATION FOR SEQ ID NO: 1986 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1380 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Cryptococcus neoformans 

(B) STRAIN: B3501 

(C) ACCESSION NUMBER: U81803 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1986 



ATGGGTAAGG ACAAGCTGCA CGTCAACGTC GTTGTTATCG GTCACGTCGA 50 

CTCCGGTAAG TCGACCACCA CCGGTCACTT GATCTACAAG TGCGGTGGTA 100 

L5 TCGACAAGCG AACCATTGAG AAGTTCGAGA AGGAGGCTCA AGAGCTCGGA 150 

AAGTCTTCTT TCAAGTACGC TTGGGTTCTT GACAAGCTTA AGGCCGAGCG 200 

AGAGCGAGGT ATCACCATCG ACATTGCTCT TTGGAAGTTC GAGACCCCCA . 250 

GGTACCAGGT CACCGTCATT GACGCCCCCG GTCACCGAGA CTTCATCAAG 300 

AACATGATCA CCGGTACCTC CCAGGCTGAC TGTGCCATCC TCATCATTGC 350 

20 CACCGGTATC GGTGAGTTCG AGGCCGGTAT CTCCAAGGAC GGTCAGACCC 4 00 

GAGAGCACGC CCTCCTCGCC TTCACCCTCG GTGTCAGGCA GCTCATTGTT 450 

GCTTGCAACA AGATGGACAC CTGCAAGTGG TCCGAGGACC GATTCAACGA 500 

AATCGTCAAG GAGACCAACG GTTTCATCAA GAAGGTTGGC TACAACCCCA 550 

AGGCTGTCCC CTTCGTCCCC ATCTCTGGTT GGCACGGTGA CAACATGTTG 600 

25 GAGGAGACCA CCAACATGCC CTGGTACAAG GGATGGACCA AGGAGACCAA 650 

GTCTGGTGTT TCCAGGGGTA AGACCCTTCT CGAGGCCATC AGCGCCAGTA 70.0 

GGCCCCATAC CCGACCCACC GACAAGCCCC TCCGTCTCCC TCTCCAGGAC 750 

GTCTACAAGA TCGGTGGTAT CGGCACAGTC CCTGTCGGCC GAGTCGAGAC 800 

CGGTGTCATC AAGGCCGGTA TGGTCGTCAA GTTCGCCCCC ACCAACGTCA 850 

30 CCACTGAAGT CAAGTCCGTT GAGATGCACC ACGAGCAGAT CCCCGAGGGT 900 

CTTCCCGGAG ACAACGTTGG TTTCAACGTC AAGAACGTTT CCATCAAGGA 950 

CATCCGACGA GGTAACGTCT GTGGTGACTC CAAGAACGAC CCCCCTATGG 1000 

AGGCTGCTTC TTTCAACGCC CAGGTTATCG TCCTTAACCA CCCTGGTCAG 1050 

ATCGGTGCCG GTTACACCCC CGTTCTCGAC TGTCACACTG CCCACATTGC 1100 

35 CTGCAAGTTT GCTGAGTTGA TCGAGAAGAT TGACCGACGA ACCGGTAAGG 1150 

TCATGGAGGC CGCCCCCAAG TTCGTCAAGT CTGGTGACGC CGCCATTGTC 1200 

AAGCTTGTTG CCCAGAAGCC CCTCTGTGTT GAGACCTACG CCGACTACCC 1250 

CCCTCTTGGT CGATTCGCCG TCCGAGACAT GCGACAGACC GTTGCCGTTG 1300 

GTGTTATCAA GAGCGTGGAC AAGACCGAGA AGGGTGGCAA GGTCACCAAG 1350 

40 GCTGCTGAGA AGGCTGCCAA GAAGAAGTAA 1380 



45 



2) INFORMATION FOR SEQ ID NO: 1987 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1377 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
50 (D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 
(C) ACCESSION NUMBER: X01638 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1987 

ATGGGTAAAG AGAAGTCTCA CATTAACGTT GTCGTTATCG GTCATGTCGA 50 

TTCTGGTAAG TCTACCACTA CCGGTCATTT GATTTACAAG TGTGGTGGTA 100 

TTGACAAGAG AACCATCGAA AAGTTCGAAA AGGAAGCCGC TGAATTAGGT 150 

10 AAGGGTTCTT TCAAGTACGC TTGGGTTTTG GACAAGTTAA AGGCTGAAAG 200 

AGAAAGAGGT ATCACTATCG ATATTGCTTT GTGGAAGTTC GAAACTCCAA 250 

AGTACCAAGT TACCGTTATT GATGCTCCAG GTCACAGAGA TTTCATCAAG 300 

AACATGATTA CTGGTACTTC TCAAGCTGAC TGTGCTATCT TGATTATTGC 350 

TGGTGGTGTC GGTGAATTCG AAGCCGGTAT CTCTAAGGAT GGTCAAACCA 4 00 

15 GAGAACACGC TTTGTTGGCTV TTCACCTTGG GTGTTAGACA ATTGATTGTT 4 50 

GCTGTCAACA AGATGGACTC CGTCAAATGG GACGAATCCA GATTCCAAGA 500 

AATTGTCAAG GAAACCTCCA ACTTTATCAA GAAGGTTGGT TACAACCCAA 550 

AGACTGTTCC ATTCGTCCCA ATCTCTGGTT GGAACGGTGA CAACATGATT 600 

GAAGCTACCA CCAACGCTCC ATGGTACAAG GGTTGGGAAA AGGAAACCAA 650 

20 GGCCGGTGTC GTCAAGGGTA AGACTTTGTT GGAAGCCATT GACGCCATTG 700 

AACAACCATC TAGACCAACT GACAAGCCAT TGAGATTGCC ATTGCAAGAT 750 

GTTTACAAGA TCGGTGGTAT TGGTACTGTG CCAGTCGGTA GAGTTGAAAC 800 

CGGTGTCATC AAGCCAGGTA TGGTTGTTAC TTTCGCCCCA GCTGGTGTTA 850 

CCACTGAAGT CAAGTCCGTT GAAATGCATC ACGAACAATT GGAACAAGGT 900 

25 GTTCCAGGTG ACAACGTTGG TTTCAACGTC AAGAACGTTT CCGTTAAGGA 950 

AATCAGAAGA GGTAACGTCT GTGGTGACGC TAAGAACGAT CCACCAAAGG 1000 

GTTGCGCTTC TTTCAACGCT ACCGTCATTG TTTTGAACCA TCCAGGTCAA 1050 

ATCTCTGCTG GTTACTCTCC AGTTTTGGAT TGTCACACTG CTCACATTGC 1100 

TTGTAGATTC GACGAATTGT TGGAAAAGAA CGACAGAAGA TCTGGTAAGA 1150 

30 AGTTGGAAGA CCATCCAAAG TTCTTGAAGT CCGGTGACGC TGCTTTGGTC 1200 

AAGTTCGTTC CATCTAAGCC AATGTGTGTT GAAGCTTTCA GTGAATACCC 1250 

ACCATTAGGT AGATTCGCTG TC AG AG AC AT GAGACAAACT GTCGCTGTCG 1300 

GTGTTATCAA GTCTGTTGAC AAGACTGAAA AGGCCGCTAA GGTTACCAAG 1350 

GCTGCTCAAA AGGCTGCTAA GAAATAA 1377 

35 



2) INFORMATION FOR SEQ ID NO: 1988 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1377 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



45 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Saccharomyces cerevisiae 
50 (C) ACCESSION NUMBER: M10992 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1988 
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ATGGGTAAAG AGAAGTCTCA CATTAACGTT GTCGTTATCG GTCATGTCGA 50 

TTCTGGTAAG TCTACCACTA CCGGTCATTT GATTTACAAG TGTGGTGGTA 100 

TTGACAAGAG AACCATCGAA AAGTTCGAAA AGGAAGCCGC TGAATTAGGT 150 

AAGGGTTCTT TCAAGTACGC TTGGGTTTTG GACAAGTTAA AGGCTGAAAG 200 

5 AGAAAGAGGT ATCACTATCG ATATTGCTTT GTGGAAGTTC GAAACTCCAA 250 

AGTACCAAGT TACCGTTATT GATGCTCCAG GTCACAGAGA TTTCATCAAG 300 

AACATGATTA CTGGTACTTC TCAAGCTGAC TGTGCTATCT TGATTATTGC 350 

TGGTGGTGTC GGTGAATTCG AAGCCGGTAT CTCTAAGGAT GGTCAAACCA 400 

GAGAACACGC TTTGTTGGCT TTCACCTTGG GTGTTAGACA ATTGATTGTT 450 

10 GCTGTCAACA AGATGGACTC CGTCAAATGG GACGAATCCA GATTCCAAGA 500 

AATTGTCAAG GAAACCTCCA ACTTTATCAA GAAGGTTGGT TACAACCCAA 550 

AGACTGTTCC ATTCGTCCCA ATCTCTGGTT GGAACGGTGA CAACATGATT 600 

GAAGCTACCA CCAACGCTCC ATGGTACAAG GGTTGGGAAA AGGAAACCAA 650 

GGCCGGTGTC GTCAAGGGTA AGACTTTGTT GGAAGCCATT GACGCCATTG 700 

15 AACAACCATC TAGACCAACT GACAAGCCAT TGAGATTGCC ATTGCAAGAT 750 

GTTTACAAGA TTGGTGGTAT TGGTACTGTG CCAGTCGGTA GAGTTGAAAC 800 

CGGTGTCATC AAGCCAGGTA TGGTTGTTAC TTTTGCCCCA GCTGGTGTTA 850 

CCACTGAAGT CAAGTCCGTT GAAATGCATC ACGAACAATT GGAACAAGGT 900 

GTTCCAGGTG ACAACGTTGG TTTCAACGTC AAGAACGTTT CCGTTAAGGA 950 

20 AATCAGAAGA GGTAACGTCT GTGGTGACGC TAAGAACGAT CCACCAAAGG 1000 

GTTGCGCTTC TTTCAACGCT ACCGTCATTG TTTTGAACCA TCCAGGTCAA 1050 

ATCTCTGCTG GTTACTCTCC AGTTTTGGAT TGTCACACTG CTCACATTGC 1100 

TTGTAGATTC GACGAATTGT TGGAAAAGAA CGACAGAAGA TCTGGTAAGA 1150 

AGTTGGAAGA CCATCCAAAG TTCTTGAAGT CCGGTGACGC TGCTTTGGTC 1200 

25 AAGTTCGTTC CATCTAAGCC AATGTGTGTT GAAGCTTTCA GTGAATACCC 1250 

ACCATTAGGT AGATTCGCTG TC AG AG AC AT GAGACAAACT GTCGCTGTCG 1300 

GTGTTATCAA GTCTGTTGAC AAGACTGAAA AGGGCGCTAA GGTTACCAAG 1350 

GCTGCTCAAA AGGCTGCTAA GAAATAA 1377 



30 



2) INFORMATION FOR SEQ ID NO: 1989 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 1377 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Eremothecium gossypii 

(B) STRAIN: ATCC 10895 

45 (C) ACCESSION NUMBER: X73978 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1989 

ATGGGTAAGG AAAAGACTCA CGTTAACGTT GTCGTCATCG GTCACGTCGA 50 

50 CTCTGGTAAG TCTACTACCA CCGGTCACTT GATCTACAAG TGTGGTGGTA 100 

TTGACAAGAG AACCATCGAG AAGTTCGAGA AGGAGGCTGC CGAGTTGGGT 150 

AAGGGTTCTT TCAAGTACGC CTGGGTTTTG GACAAATTGA AGGCTGAGAG 200 

AGAGAGAGGT ATCACCATCG ACATTGCGTT GTGGAAGTTC GAGACTCCAA 250 
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AGTACCACGT CACTGTCATT GACGCCCCAG GCCACAGAGA CTTCATCAAG 30 0 

AACATGATTA CCGGTACTTC TCAAGGTGAC TGTGCCATCT TGATCATTGC 350 

TGGTGGTGTC GGTGAGTTCG AGGCTGGTAT CTCCAAGGAC GGTCAGACCA 40 0 

GAGAGCACGC TTTGTTGGCT TACACCTTGG GTGTCAAGCA GTTGATCGTT 450 

5 GCCATCAACA AGATGGACTC CGTCAAGTGG GACGAGTCCA GATACCAGGA 500 

GATTGTCAAG GAGACCTCCA ACTTCATCAA GAAGGTCGGT TACAACCCTA 550 

AGACTGTTCC ATTCGTTCCA ATCTCCGGCT GGAACGGTGA CAACATGATT 60 0 

GAGGCCACCA CCAACGCCCC ATGGTACAAG GGCTGGGAGA AGGAGACCAA 650 

GGCTGGTGCC GTCAAGGGTA AGACCTTGTT GGAGGCCATT GACGCCATTG 700 

LO AGCCACCTGT CAGACCAACT GACAAGGCAT TGAGATTGCC ATTGCAGGAT 750 

GTCTACAAGA TCGGTGGTAT TGGTACGGTT CCAGTCGGCA GAGTCGAGAC 800 

CGGTGTCATC AAGCCAGGTA TGGTTGTTAC CTTCGCCCCA TCCGGTGTCA 850 

CCACTGAAGT CAAGTCCGTC GAGATGCACC ACGAGCAATT GGAGGAGGGT 900 

GTCCCAGGTG ACAACGTTGG TTTCAACGTC AAGAACGTCT CCGTCAAGGA 950 

L5 GATCAGAAGA GGTAACGTTT GCGGTGACTC CAAGAACGAC CCACCAAAGG 1000 

CTGCTGAGTC CTTCAACGCT ACCGTCATTG TCTTGAACCA CCCAGGTCAA 1050 

ATCTCTGCCG GTTACTCTCC AGTCTTGGAC TGTCACACTG CCCACATTGC 1100 

TTGTAAGTTC GACGAGTTGT TGGAGAAGAA CGACAGAAGA ACCGGTAAGA 1150 

AGTTGGAAGA CTCTCCAAAG TTCCTAAAGG CCGGTGACGC TGCCATGGTC 1200 

20 AAGTTTGTCC CATCCAAGCC AATGTGTGTT GAGGCTTTCA CCGACTACCC 1250 

ACCATTGGGT AGATTCGCTG TCAGAGACAT GAGACAGACC GTTGCTGTCG 1300 

GTGTCATCAA GTCTGTTGTC AAGTCCGACA AGGCTGGTAA GGTCACCAAG 1350 

GCCGCCCAAA AGGCTGGTAA GAAATAG 1377 



25 

2) INFORMATION FOR SEQ ID NO: 1990 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 1377 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



35 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Eremothecium gossypii 
(C) ACCESSION NUMBER: A29820 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1990 



ATGGGTAAGG AAAAGACTCA CGTTAACGTT 
CTCTGGTAAG TCTACTACCA CCGGTCACTT 
TTGACAAGAG AACCATCGAG AAGTTCGAGA 
AAGGGTTCTT TCAAGTACGC CTGGGTTTTG 
AGAGAGAGGT ATCACCATCG ACATTGCGTT 
AGTACCACGT CACTGTCATT GACCCCCCAG 
AACATGATTA CCGGTACTTC TCAAGCTGAC 
TGGTGGTGTC GGTGAGTTCG AGGCTGGTAT 
GAGAGCACGC TTTGTTGGCT TACACCTTGG 
GCCATCAACA AGATGGACTC CGTCAAGTGG 
GATTGTCAAG GAGACCTCCA ACTTCATCAA 



GTCGTCATCG GTCACGTCGA 50 

GATCTACAAG TGTGGTGGTA 100 

AGGAGGCTGC CGAGTTGGGT 150 

GACAAATTGA AGGCTGAGAG 200 

GTGGAAGTTC GAGACTCCAA 250 

GCCACAGAGA CTTCATCAAG 300 

TGTGCCATCT TGATCATTGC 350 

CTCCAAGGAC GGTCAGACCA 400 

GTGTCAAGCA GTTGATCGTT 450 

GACGAGTCCA GATACCAGGA 500 

GAAGGTCGGT TACAACCCTA 550 
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AGACTGTTCC ATTCGTTCCA ATCTCCGGCT GGAACGGTGA CAACATGATT 600 

GAGGCCACCA CCAACGCCCC ATGGTACAAG GGCTGGGAGA AGGAGACCAA 650 

GGCTGGTGCC GTCAAGGGTA AGACCTTGTT GGAGGCCATT GACGCCATTG 700 

AGCCACCTGT CAGACCAACT GACAAGGCAT TGAGATTGCC ATTGCAGGAT 750 

5 GTCTACAAGA TCGGTGGTAT TGGTACGGTT CCAGTCGGCA GAGTCGAGAC 800 

CGGTGTCATC AAGCCAGGTA TGGTTGTTAC CTTCGCCCCA TCCGGTGTCA 850 

CCACTGAAGT CAAGTCCGTC GAGATGCACC ACGAGCAATT GGAGGAGGGT 900 

GTCCCAGGTG ACAACGTTGG TTTCAACGTC AAGAACGTCT CCGTCAAGGA 950 

GATCAGAAGA GGTAACGTTT GCGGTGACTC CAAGAACGAC CCACCAAAGG 1000 

10 CTGCTGAGTC CTTCAACGCT ACCGTCATTG TCTTGAACCA CCCAGGTCAA 1050 

ATCTCTGCCG GTTACTCTCC AGTCTTGGAC TGTCACACTG CCCACATTGC 1100 

TTGTAAGTTC GACGAGTTGT TGGAGAAGAA CGACAGAAGA ACCGGTAAGA 1150 

AGTTGGAAGA CTCTCCAAAG TTCCTAAAGG CCGGTGACGC TGCCATGGTC 1200 

AAGTTTGTCC CATCCAAGCC AATGTGTGTT GAGGCTTTCA CCGACTACCC 1250 

L5 ACCATTGGGT AGATTCGCTG TCAGAGACAT GAGACAGACC GTTGCTGTCG 13 00 

GTGTCATCAA GTCTGTTGTC AAGTCCGACA AGGCTGGTAA GGTCACCAAG 1350 

GCCGCCCAAA AGGCTGGTAA GAAATAG 1377 



20 



30 



35 



2) INFORMATION FOR SEQ ID NO: 1991 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1646 bases 
25 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Aspergillus oryzae 

(B) STRAIN: KBN616 

(C) ACCESSION NUMBER: AB007770 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1991 



TGGGGTAAGT TTATCAACCC GTCGAGTTGT GTTGCATCTC AGATCATGGC 50 

TGACAAGTAC TTTCCTCCTT ACAGTAAGGA AGACAAGCAG CACATCAACA 100 

£0 TCGTCGTTAT CGGCCACGTC GATTCCGGCA AGTCCACCAC CACTGGTCAC 150 

TTGATCTACA AGTGTGGTGG TATCGACCAG CGTACCATCG AGAAGTTCGA 200 

GAAGGAAGCC GCTGAGCTCG GTAAGGGTTC CTTCAAGTAC GCCTGGGTTC 250 

TTGACAAGCT CAAGTCCGAG CGTGAGCGTG GTATCACCAT CGATATCGCC 300 

CTCTGGAAGT TCCAGACCTC CAAGTATGAG GTCACCGTCA TTGGTAAGCA 350 

45 TTTGAGTTCC AACCTACGTT GCCCAACATT TACAGTCATC TAACAAAGTT 400 

CAATAGATGC CCCCGGTCAC CGTGACTTCA TCAAGAACAT GATCACTGGT 450 

ACTTCCCAGG CTGACTGCGC TATCCTCATC ATTGCCTCCG GTACTGGTGA 500 

ATTCGAGGCT GGTATCTCCA AGGATGGTCA GACCCGTGAG CACGCTCTGC 550 

TCGCTTTCAC CCTCGGTGTC CGTCAGCTCA TCGTTGCCCT CAACAAGATG 600 

50 GACACCTGCA AGTGGTCTCA GGATCGTTAC AACGAAATCG TTAAGGAGAC 650 

TTCCAACTTC ATCAAGAAGG TCGGATACAA CCCCAAGAGC GTTCCTTTCG 700 

TCCCCATCTC CGGTTTCAAC GGTGACAACA TGATTGAGGC CTCCACCAAC 750 

TGCCCCTGGT ACAAGGGCTG GGAGAAGGAG ACCAAGGCTG GCAAGTCCAC 800 
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CGGTAAGACC CTTCTCGAGG CCATCGATGC CATCGAGCCC CCCGTCCGTC 850 

CCACCGACAA GCCTCTCCGT CTTCCCCTCC AGGATGTCTA CAAGATCTCT 900 

GGTATCGGTA CTGTGCCCGT CGGTCGTGTC GAGACTGGTG TCATCAAGCC 950 

TGGTATGGTC GTTACTTTCG CTCCTGCCAA CGTGACCACT GAAGTCAAGT 1000 

CCGTTGAAAT GCACCACCAG CAGCTCCAGG CCGGTAACCC CGGTGACAAC 1050 

GTTGGTTTCA ACGTCAAGAA CGTCTCCGTC AAGGAAGTCC GCCGTGGTAA 1100 

CGTTGCCGGT GACTCCAAGA ACGACCCCCC TGCTGGCTGC GATTCCTTCA 1150 

ACGCCCAGGT CATCGTCCTT AACCACCCCG GTCAGGTCGG CAACGGTTAC 1200 

GCTCCCGTCC TGGACTGCCA CACCGCTCAC ATTGCTTGCA AGTTCGCTGA 1250 

GCTCCTTGAG AAGATTGACC GCCGTACCGG TAAATCTGTT GAGGACAAGC 1300 

CCAAGTTCAT CAAGTCTGGT GATGCTGCCA TCGTCAAGAT GATTCCCTCC 1350 

AAGCCCATGT GTGTGGAGTC TTTCACTGAC TTCCCCCCTC TTGGTCGTTT 14 00 

CGCTGTCCGT GACGTAAGTT TTTCCCTCTT GACTATCTTC ACAATTTTTC 1450 

ACATATTTTC ACGCCTCGTC CCACTCTTTT TCCTCCCTTC CTCTTTGGTT 1500 

CCCCTTTTTG CCTGCAAGTT CTCTATAGCT AACATGATGT CTAGATGCGT 1550 

CAAACTGTTG CCGTCGGAGT TATCAAGTCG GTTGAGAAGA ACACTGGCGG 1600 

TTCTGGCAAG GTCACCAAGG CCGCCCAGAA GGCTGGCAAG AAATAA 1646 



2) INFORMATION FOR SEQ ID NO: 1992 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1380 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

. (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aureobasidium pullulans 

(B) STRAIN: R106 

(C) ACCESSION NUMBER: U19723 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1992 

ATGGGTAAGG AAAAGTCCCA CATCAACGTC GTCGTTATCG GCCACGTCGA 50 

CTCCGGTAAG TCGACCACCA CCGGTCACTT GATCTACAAG TGCGGTGGTA 100 

TCGACAAGCG TACCATCGAG AAGTTCGAGA AGGAAGCCGC CGAACTCGGC 150 

AAGGGTTCCT TCAAGTACGC CTGGGTCCTC GACAAGCTGA AGTCTGAGCG 200 

TGAGCGTGGT ATCACTATCG ATATCGCTCT GTGGAAGTTC GAGACCCCCA 250 

AGTACATGGT CACCGTCATC GATGCCCCCG GTCACCGTGA TTTCATCAAG 300 

AACATGATCA CTGGTACCTC CCAGGCTGAC TGCGCCATTC TCATCATTGC 350 

CGCCGGTACT GGTGAGTTCG AGGCTGGTAT CTCCAAGGAT GGCCAGACTC 400 

GTGAGCACGC CCTTCTCGCC TACACCCTTG GTGTCAAGCA GCTCATCGTC 4 50 

GCTATCAACA AGATGGACAC CACCAAGTGG TCTGAGGCCC GTTACCAGGA 500 

GATCATCAAG GAGACCTCCG GTTTCATCAA GAAGGTCGGC TACAACCCCA 550 

AGCACGTTCC CTTTGTCCCC ATCTCGGGTT TCAACGGTGA CAACATGATT 600 

GAGGTTTCTT CCAACTGCCC CTGGTACAAG GGTTGGGAGA AGGAGACCAA 650 

GGCCAAGGCC ACTGGTAAGA CTCTCCTCGA GGCCATTGAC GCCATCGACC 700 

CTCCTTCGCG CCCCACCGAC AAGCCCCTCC GTCTTCCCCT CCAGGATGTC 750 

TACAAGATCG GTGGTATTGG CACGGTGCCC GTCGGCCGTG TCGAGACCGG 800 
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TACCATCAAG GGTGGTATGG 
CTGAGGTCAA GTCCGTCGAG 
CCCGGTGACA ACGTCGGCTT 
CCGTCGTGGT AACGTTGCCG 
5 GCGACTCCTT CAACGCCCAG 
GGTGCTGGTT ACGCACCCGT 
CAAGTTCTCC GAGCTTGTTG 
TTGAGGCCGC CCCCAAGTTC 
ATGGTTCCCT CCAAGCCTAT 
10 TCTCGGTCGT TTCGCCGTCC 
TCATCAAGTC CGTCGCCAAG 
AAGGCCGCTG TCAAGGCTGG 



TCGTCACCTT CGCCCCCGCT 
ATGCACCACG AGCAGCTCTC 
CAACGTCAAG AACGTCTCCG 
GTGACTCCAA GAACGACCCC 
GTCATCGTCC TCAACCACCC 
CCTCGATTGC CACACTGCCC 
AGAAGATTGA CCGCCGTACC 
ATCAAGTCTG GTGACGCCGC 
GTGTGTTGAG GCCTTCACTG 
GTGACATGAG ACAGACCGTC 
TCCGACAAGC AGGGTGCCGG 
CAAGAAGTAA 



PCT/CA00/01150 

GGTGTCACCA 850 

CGAGGGTCTC 900 

TCAAGGAGAT 950 

CCCAAGGGTT 1000 

CGGTCAGGTC 1050 

ACATCGCCTG 1100 

GGCAAGTCCG 1150 

CATCGTCAAG 1200 

ACTACCCTCC 1250 

GCTGTCGGTG 1300 

TAAGGTTACC 1350 

1380 



15 



25 



30 



2) INFORMATION FOR SEQ ID NO: 1993 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1383 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Histoplasma capsulatum 

<B) STRAIN: 186AS 

(C) ACCESSION NUMBER: U14100 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1993 



ATGGGTAAGG AAGACAAGAC TCACATTAAC CTCGTCGTCA TCGGCCACGT 50 

CGATTCGGGC AAATCTACCA CCACTGGTCA TTTGATCTAC AAATGCGGTG 100 

35 GTATTGACAG CCGTACCATT GAGAAGTTCG AAAAGGAAGC CGAAGAGTTG 150 

GGCAAGAAAT CCTTCAAATA TGCGTGGGTC CTTGACAAAC TGAAGTCTGA 200 

GCGTGAGCGT GGTATCACCA TCGATATTGC CCTCTGGAAA TTCGAGACTC 250 

CGAAGTACAG TGTCACTGTC ATTGATGCTC CCGGCCATCG TGACTTCATC 300 

AAGAACATGA TCACTGGTAC CTCCCAGGCT GACTGCGCTA TCCTCATCAT 350 

40 TGCTGCCGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG GATGGCCAGA 400 

CTCGTGAGCA CGCTCTGCTT GCTTTCACCC TTGGTGTGAG GCAACTCATC 450 

GTTGCCATCA ACAAGATGGA CACCACCAAG TGGTCCGAGT CCCGTTTCAA 500 

CGAAATCATC AAGGAGGTTT CCAACTTCAT CAAGAAGGTC GGATATAACC 550 

CCAAGGCTGT TCCCTTCGTG CCAATCTCTG GTTTCGAGGG TGACAACATG 600 

45 ATTGAACCCT CCCCCAACTG CACATGGTAC AAGGGCTGGA ACAAGGAGAC 650 

TGCCTCTGGC AAGTCTTCTG GTAAAACCCT TCTCGATGCC ATTGACGCCA 700 

TTGAACCCCC AACCCGTCCT ACCGATAAGC CCCTCCGTCT TCCCCTCCAG 750 

GATGTGTACA AAATCTCTGG TATTGGCACT GTTCCCGTCG GACGTGTTGA 800 

GACTGGTGTC ATCAAGCCCG GTATGGTCGT GACTTTCGCT CCCTCCAACG 850 

50 TCACCACTGA AGTCAAGTCC GTCGAAATGC ACCACCAACA ACTCCAGGCT 900 

GGTTACCCTG GCGACAACGT CGGCTTCAAC GTCAAGAACG TTTCAGTCAA 950 

GGAAGTCCGC CGTGGCAACG TTGCTGGCGA CTCCAAAAAT GATCCCCCTA 1000 

AGGGCTGCGA ATCCTTCAAT GCCCAGGTCA TCGTCCTTAA CCACCCCGGC 1050 
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CAGGTTGGCG CTGGTTATGC CCCAGTCCTC 

TGCTTGCAAG TTCTCTGAGC TTATTGAGAA 

AGTCTGTTGA GAACAACCCC AAGTTCATCA 

GTCAAGATGG TTCCCTCCAA GCCCATGTGC 

5 TCCCCCTCTT GGACGTTTCG CTGTCCGTGA 

TCGGTGTCAT CAAGTCCGTC ATCAAGTCTG 

ACCAAGGCCG CGCAGAAGGC CACCAAGAAA 



GACTGCCACA CTGCCCACAT 1100 

GATCGACCGC CGTACCGGAA 1150 

AGTCTGGTGA TGCTGCTATC 1200 

GTGGAGCCCT TCACTGACTA 1250 

CAT GAG AC AA ACCGTCGCTG 1300 

ACAAGACTGC TGGCAAGGTC 1350 

TAA 1383 



LO 

2) INFORMATION FOR SEQ ID NO: 1994 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1383 bases 
L5 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

20 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neurospora crassa 
(C) ACCESSION NUMBER: D45837 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1994 

ATGGGCAAGG AGGACAAGAC TCACAT.CAAC GTCGTCGTTA TCGGCCACGT 50 

CGATTCCGGC AAGTCTACCA CTACCGGTCA CTTGATCTAC AAGTGCGGTG 100 

GTATCGACAA GCGTACCATC GAGAAGTTCG AGAAGGAAGC CGCTGAGCTC 150 

30 GGTAAGGGTT CCTTCAAGTA TGCCTGGGTT CTTGACAAGC TCAAGGCCGA 200 

GCGTGAGCGT GGTATCACCA TCGATATCGC CCTCTGGAAG TTCGAGACTC 250 

CCAAGTACTA CGTCACCGTC ATCGATGCCC CCGGTCATCG TGATTTCATC 300 

AAGAACATGA TCACTGGTAC CTCCCAGGCT GATTGCGCTA TCCTCATCAT 350 

TGCCGCCGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG GATGGCCAGA 400 

35 CCCGTGAGCA CGCCCTGCTC GCCTACACCC TCGGTGTCAA GCAGCTCATT 4 50 

GTTGCCATCA ACAAGATGGA CACCACCCAG TGGTCCCAGA CTCGTTTCGA 500 

GGAGATCATC AAGGAGACCA AGAACTTCAT CAAGAAGGTT GGCTACAACC 550 

CCGCTGGTGT CGCTTTCGTC CCCATCTCCG GCTTCAACGG CGACAACATG 600 

CTTGAGCCCT CCACCAACTG CCCCTGGTAC AAGGGTTGGG AGAAGGAGAC 650 

10 CAAGGCCGGC AAGGCCACTG GCAAGACCCT CCTCGAGGCC ATCGACGCCA 7 00 

TTGAGCCCCC CAAGCGTCCT ACCGACAAGC CCCTCCGTCT TCCCCTCCAG 750 

GATGTCTACA AGATCGGTGG TATCGGCACA GTGCCCGTCG GCCGTATCGA 800 

GACTGGTGTC CTCAAGCCCG GTATGGTCGT TACCTTCGCT CCTTCCAACG 850 

TCACCACTGA AGTCAAGTCC GTCGAGATGC ACCACGAGCA GCTTGCTCAG 900 

15 GGTGTCCCCG GTGACAACGT CGGCTTCAAC GTGAAGAACG TTTCCGTCAA 950 

GGATATCCGC CGTGGTAACG TTGCCGGTGA CTCCAAGAAC GACCCCCCTG 1000 

CTGGCGCCGC CTCTTTCACC GCCCAGGTCA TCGTTCTCAA CCACCCCGGT 1050 

CAGGTCGGTG CCCGCTACGC CCCCGTCCTC GACTGCCACA CTGCCCACAT 1100 

TGCCTGCAAG TTCGCCGAGC TCCTCGAGAA GATCGACCGC CGTACTGGTA 1150 

50 AGGCTGTTGA GGCCTCCCCC AAGTTCATCA AGTCTGGTGA TGCTGCCATC 1200 

GTCAAGATGA TTCCCTCCAA GCCCATGTGC GTTGAGGCTT TCACCGACTA 1250 

CCCTCCCCTC GGCCGTTTCG CCGTCCGTGA CATGCGTCAG ACCGTCGCCG 1300 

TCGGTGTCAT CAAGGCCGTC GACAAGTCCA CCGCTGCCGC TGGCAAGGTC 1350 
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ACCAAGTCCG CTGCCAAGGC CGCCAAGAAG TAA T383 



5 2) INFORMATION FOR SEQ ID NO: 1995 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1383 bases 

(B) TYPE: Nucleic acid 
L0 (C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

L5 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Podospora anserina 

(C) ACCESSION NUMBER: X74799 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1995 



ATGGGCAAGG AGGACAAGAC TCACATCAAC GTCGTCGTTA TCGGCCACGT 50 

CGATTCCGGC AAGTCGACCA CCACTGGTCA CTTGATCTAC AAGTGCGGTG 100 

GTATTGACAA GCGTACCATC GAGAAGTTCG AGAAGGAAGC TGCTGAGCTC 150 

GGCAAGGGCT CTTTCAAGTA TGCCTGGGTT CTTGACAAGT TGAAGGCCGA 200 

25 GCGTGAGCGT GGTATCACCA TCGATATTGC CCTCTGGAAG TTCGAGACCC 250 

CCAAGTACTA TGTCACCGTC ATTGATGCCC CCGGCCATCG TGATTTCATC 300 

AAGAACATGA TTACTGGTAC TTCCCAGGCC GATTGCGCCA TTCTCATCAT 350 

TGCCGCCGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG GATGGCCAGA 400 

CCGGTGAGCA CGCTCTCCTC GCCTACACCC TCGGTGTGAA GCAGCTCATC 450 

30 GTCGCCATCA ACAAGATGGA CACCACCAAG TGGTCCGAGG CCCGCTTCAA 500 

CGAGATCATC AAGGAGACCT CCAACTTCAT CAAGAAGGTC GGCTACAACC 550 

CCAAGACTGT TGCCTTCGTC CCCATCTCCG GTTTCAACGG CGACAACATG 600 

CTTGAGGCTT CCACCAACTG CCCCTGGTAC AAGGGCTGGG AGAAGGAGGT 650 

CAAGGGTGGC AAGGCCACCG GCAAGACCCT CCTTGAGGCC ATCGACTCCA 700 

35 TCGAGCCCCC CAAGCGTCCC ACCGACAAGC CCCTCCGTCT TCCCCTCCAG 750 

GATGTCTACA AGATCGGCGG TATCGGCACA GTCCCTGTCG GCCGTATCGA 800 

GACTGGTATC CTCAAGCCCG GTATGGTCGT TACCTTCGCT CCTTCCAACG 850 

TCACCACTGA AGTCAAGTCC GTCGAGATGC ACCACGAGCA GCTCGCTGAG 900 

GGTGTTCCCG GTGACAACGT TGGTTTCAAC GTGAAGAACG TCTCCGTCAA 950 

40 GGAAATCCGC CGTGGCAACG TTGCCGGTGA CTCCAAGAAC GACCCCCCCA 1000 

TGGGCGCCGC CTCTTTCGAT GCCCAGGTCA TCGTCCTCAA CCACCCCGGC 1050 

CAGGTCGGTG CTGGTTACGC CCCCGTCCTC GATTGCCACA CTGCCCACAT 1100 

CGCCTGCAAG TTCTCTGAGC TCCTGCAGAA GATCGACCGC CGTACTGGTA 1150 

AGGCCGTTGA GGAGAGCCCC AAGTTCATCA AGTCTGGTGA TGCTGCCATC 1200 

45 GTCAAGATGG TTCCCTCCAA GCCCATGTGC GTTGAGGCTT TCACTGAGTA 1250 

CCCTCCCCTC GGTCGTTTCG CCGTCCGTGA CATGCGTCAG ACCGTCGCTG 1300 

TCGGTGTCAT CAAGAAGGTC GAGAAGGCCG CTGCTGGTTC CGGCAAGGTT 1350 

ACCAAGTCCG CTGCCAAGGC TGGCAAGAAA TAA 1383 



2) INFORMATION FOR SEQ ID NO: 1996 



1038 

SUBSTITUTE SHEET {RULE 26) 
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PCT/CA00/01150 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 138 6 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Podospora curvicolla 

(B) STRAIN: VLV 

(C) ACCESSION NUMBER: X96614 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1996 

ATGGGCAAGG AGGACAAGAC TCACATCAAC GTCGTCGTTA TCGGCCACGT 50 

CGATTCCGGC AAGTCGACCA CCACTGGTCA CTTGATCTAC AAGTGCGGTG 100 

GTATTGACAA GCGTACCATC GAGAAGTTCG AGAAGGAAGC TGCTGAGCTC 150 

GGCAAGGGCT CTTTCAAGTA TGCCTGGGTT CTTGACAAGT TGAAGGCCGA 200 

GCGTGAGCGT GGTATCACCA TTGATATCGC CCTCTGGAAG TTCGAGACCC 250 

CCAAGTACTA TGTCACCGTC ATCGATGCCC CCGGCCATCG TGATTTCATC 300 

AAGAACATGA TTACTGGTAC TTCCCAGGCC GATTGCGCCA TTCTCATCAT 350 

TGCCGCCGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG GATGGCCAGA 400 

CCCGTGAGCA CGCTCTCCTC GCCTACACCC TCGGTGTGAA GCAGCTCATC 450 

GTCGCCATCA ACAAGATGGA CACCACCAAA TGGTCCGAGG CCCGCTTCAA 500 

CGAGATCATC AAGGAGACCT CCAACTTCAT CAAGAAGGTC GGCTACAACC 550 

CCAAGACTGT TGCCTTCGTC CCCATCTCCG GTTTCAACGG CGACAACATG 600 

CTTGAGGCTT CCACCAACTG CCCCTGGTAC AAGGGTTGGG AGAAGGAGGT 650 

CAAGGGTGGC AAGGCTACTG GCAAGACCCT CCTCGAGGCC ATCGACTCCA 700 

TCGAGCCCCC CAAGCGTCCC ACCGACAAGC CCCTCCGTCT TCCCCTTCAG 750 

GACGTTTACA AGATCGGCGG TATCGGCACA GTCCCTGTCG GCCGTATCGA 800 

GACTGGTATC CTCAAGCCCG GTATGGTCGT TACCTTCGCC CCTTCCAACG 850 

TCACCACTGA AGTCAAGTCC GTCGAGATGC ACCACGAGCA GCTCTCTGAG 900 

GGTGTCCCCG GTGACAACGT TGGTTTCAAC GTGAAGAACG TCTCCGTCAA 950 

GGAAATCCGC CGTGGCAACG TTGCCGGTGA CTCCAAGAAC GACCCCCCTC 1000 

TTGGCGCCGC TTCTTTCGAT GCCCAGGTCA TCGTCCTCAA CCACCCCGGC 1050 

CAGGTCGGTG CTGGTTACGC CCCCGTCCTC GATTGCCACA CTGCCCACAT 1100 

CGCCTGCAAG TTCGCTGAGC TCCTGCAGAA GATCGATCGC CGTACTGGTA 1150 

AGGCTGTTGA GGAGAGCCCT AAGTTCATCA AGTCTGGTGA TGCTGCCATC 1200 

GTCAAGATGA TTCCCTCCAA GCCCATGTGC GTTGAGGCTT TCACTGAGTA 1250 

CCCTCCCCTC GGTCGTTTCG CTGTCCGTGA CATGCGTCAG ACCGTCGCTG 1300 

TCGGTGTCAT CAAGAAGGTC GAGAAGGCCG CTGCTGGTTC CGGCAAGGTC 1350 

ACCAAGTCCG CTGCCAAGGC TGGTGGCAAG AAATAA 1386 



2) INFORMATION FOR SEQ ID NO: 1997 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1383 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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LO 



10 



<ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Sordaria macrospora 

(B) STRAIN: OOO 

(C) ACCESSION NUMBER: X96615 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1997 



ATGGGTAAGG AAGACAAGGC TCACATCAAC GTCGTCGTTA TCGGCCACGT 50 

CGATTCCGGC AAGTCCACCA CTACCGGTCA CCTGATCTAC AAGTGCGGTG 100 

GTATCGACAA GCGTACCATC GAGAAGTTCG AGAAGGAAGC CGCTGAGCTC 150 

GGCAAGGGTT CCTTCAAGTA TGCCTGGGTT CTTGACAAGC TCAAGGCCGA 200 

L5 GCGTGAGCGT GGTATCACCA TCGATATCGC CCTCTGGAAG TTCGAGACTC 250 

CCAAGTACTA CGTCACCGTC ATCGATGCCC CCGGCCATCG TGATTTCATC 300 

AAGAACATGA TCACTGGTAC CTCCCAGGCT GATTGCGCTA TTCTCATCAT 350 

TGCCGCTGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG GATGGCCAGA 4 00 

CTCGTGAGCA. CGCTCTTCTC GCCTACACCC TCGGTGTCAA GCAGCTCATC 450 

10 GTTGCCATCA ACAAGATGGA CACCACCCAG TGGTCCCAGG CTCGTTTCGA 500 

GGAGATCATC AAGGAGACCA AGAACTTCAT CAAGAAGGTC GGCTACAACC 550 

CCGCCACCGT CGCTTTCGTC CCCATCTCCG GCTTCAACGG CGACAACATG 600 

CTTGAGGCCT CCACCAACTG CCCCTGGTAG AAGGGTTGGG AGAAGGAGAC 650 

CAAGGCCGGC AAGTCCACTG GCAAGACCCT CCTCGAGGCC ATCGACGCCA 700 

15 TTGAGCAGCC CAAGCGCCCG ACCGACAAGC CCCTCCGTCT TCCCCTCCAG 750 

GATGTCTACA AGATCGGCGG TATCGGCACA GTGCCCGTCG GCCGTATCGA 800 

GACTGGTGTC CTCAAGCCCG GTATGGTCGT TACCTTCGCT CCTTCCAACG 850 

TCACCACTGA AGTCAAGTCC GTCGAGATGC ACCACGAGCA GCTTGCTCAG 900 

GGTGTTCCCG GTGACAACGT CGGCTTCAAC GTGAAGAACG TTTCCGTCAA ; 950 

30 GGATATCCGT CGTGGTAACG TTGCCGGTGA CTCCAAGAAC GACCCCCCTG 1000 

TCGGCGCTGC CTCTTTCACC GCCCAGGTCA TCGTCCTTAA CCACCCCGGT 1050 

CAGGTCGGTG CCGGCTACGC TCCCGTCCTC GATTGCCACA CTGCCCACAT 1100 

TGCCTGCAAG TTCGCCGAGC TCCTCGAGAA GATCGATCGC CGTACTGGTA 1150 

AGGCTGTTGA GACTTCTCCC AAGTTCATCA AGTCTGGTGA TGCTGCCATC 1200 

35 GTCAAGATGA TTCCCTCCAA GCCCATGTGC GTCGAGGCTT TCACCGACTA 1250 

CCCTCCCCTC GGTCGTTTCG CCGTCCGTGA CATGCGTCAG ACCGTCGCTG 1300 

TCGGTGTCAT CAAGGCCGTC GACAAGACCC AGGCTGTCGC TGGCAAGGTC 1350 

ACCAAGTCTG CTGCCAAGGC TGCCAAGAAG TAA 1383 



2) INFORMATION FOR SEQ ID NO: 1998 



(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1383 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

50 (ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM; Trichoderma reesei 
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(B) STRAIN: QM9414 

(C) ACCESSION NUMBER: Z23012 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1998 

5 

ATGGGTAAGG AGGACAAGAC TCACATCAAC GTGGTCGTCA TCGGCCACGT 50 

CGACTCCGGC AAGTCTACCA CCACTGGTCA CTTGATCTAC CAGTGCGGTG 100 

GTATCGACAA GCGTACCATT GAGAAGTTCG AGAAGGAAGC CGCCGAACTC 150 

GGCAAGGGTT CCTTCAAGTA CGCGTGGGTT CTTGACAAGC TCAAGGCCGA 200 

10 GCGTGAGCGT GGTATCACCA TCGACATTGC CCTCTGGAAG TTCGAGACTC 250 

CCAAGTACTA TGTCACCGTC ATTGACGCTC CCGGCCACCG TGACTTCATC 300 

AAGAACATGA TCACTGGTAC TTCCCAGGCC GACTGCGCTA TCCTCATCAT 350 

CGCTGCCGGT ACTGGTGAGT TCGAGGCTGG TATCTCCAAG GATGGCCAGA 400 

CCCGTGAGCA CGCTCTGCTC GCCTACACCC TGGGTGTCAA GCAGCTCATC 450 

15 GTCGCCATCA ACAAGATGGA CACTGCCAAC TGGGCCGAGG CTCGTTACCA 500 

GGAAATCATC AAGGAGACTT CCAACTTCAT CAAGAAGGTC GGCTTCAACC 550 

CCAAGGCCGT TGCTTTCGTC CCCATCTCCG GCTTCAACGG TGACAACATG 600 

CTCACCCCCT CCACCAACTG CCCCTGGTAC AAGGGCTGGG AGAAGGAGAC 650 

CAAGGCTGGC AAGTTCACCG GCAAGACCCT CCTTGAGGCC ATCGACTCCA 700 

20 TCGAGCCCCC CAAGCGTCCC ACGGACAAGC CCCTGCGTCT TCCCCTCCAG 750 

GACGTCTACA AGATCGGTGG TATCGGAACA GTTCCCGTCG GCCGTATCGA 800 

GACTGGTGTC CTCAAGCCCG GTATGGTCGT TACCTTCGCT CCCTCCAACG 850 

TCACCACTGA AGTCAAGTCC GTCGAGATGC ACCACGAGCA GCTCGCTGAG 900 

GGCCAGCCTG GTGACAACGT TGGTTTCAAC GTGAAGAACG TTTCCGTCAA 950 

25 GGAAATCCGC CGTGGCAACG TTGCCGGTGA CTCCAAGAAC GACCCCCCCA 1000 

TGGGCGCCGC TTCTTTCACC GCCCAGGTCA TCGTCATGAA CCACCCCGGC 1050 

CAGGTCGGTG CCGGCTACGC CCCCGTCCTC GACTGCCACA CTGCCCACAT 1100 

TGCCTGCAAG TTCGCCGAGC TCCTCGAGAA GATCGACCGC CGTACCGGTA 1150 

AGGCTACCGA GTCTGCCCCC AAGTTCATCA AGTCTGGTGA CTCCGCCATC 1200 

30 GTCAAGATGA TCCCCTCCAA GCCCATGTGC GTTGAGGCTT TCACCGACTA 1250 

CCCTCCCCTG GGTCGTTTCG CCGTCCGTGA CATGCGCCAG ACCGTCGCTG 1300 

TCGGTGTCAT CAAGGCCGTC GAGAAGTCCT CTGCCGCCGC CGCCAAGGTC 1350 

ACCAAGTCCG CTGCCAAGGC CGCCAAGAAA TAA 1383 



35 



2) INFORMATION FOR SEQ ID NO: 1999 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 29 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

45 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1999 
CATGTCAAYA TTGGTACTAT TGGTCATGT 29 

50 



2) INFORMATION FOR SEQ ID NO: 2000 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

5 (C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2000 

CCACCYTCIC TCAMGTTGAA RCGTT 25 



15 



25 



30 



2) INFORMATION FOR SEQ ID NO: 2001 



(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 bases 
20 (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2001 
ACYACITTRA CIGCYGCYAT YAC 23 



2) INFORMATION FOR SEQ ID NO: 2002 



(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

40 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2002 
CCIGARGARA GAGCIMGWGG T 21 

45 



2) INFORMATION FOR SEQ ID NO: 2003 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2003 
CATYTCRAIR TTGTCACCTG G 
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2) INFORMATION FOR SEQ ID NO: 2004 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1360 bases 

(B) TYPE: Nucleic acid 
<C) STRANDEDNESS: Double 
(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Candida albicans 

(B) STRAIN: SC5314 

(C) ACCESSION NUMBER: Genome project 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2004 

GCTGCCTTCG ACCGTTCTAA ACCTCATGTC AACATTGGTA CTATTGGTCA 50 

TGTTGATCAT GGTAAAACTA CATTGACTGC TGCTATCACC AAAGTTTTAG 100 

CCGAACAAGG TGGTGCCAAC TTCTTGGATT ATGGTTCTAT TGATAGAGCT 150 

CCAGAAGAAA GAGCTAGAGG TATCACTATT TCCACTGCCC ACGTTGAATA 200 

CGAAACCAAG AACAGACACT ATGCCCACGT TGATTGTCCA GGACACGCTG 250 

ATTATATCAA AAATATGATT ACTGGTGCCG CTCAAATGGA TGGTGCTATC 3 00 

ATTGTTGTTG CTGCCACTGA TGGTCAAATG CCTCAAACCA GAGAACATTT 350 

GTTATTGGCC AGACAAGTTG GTGTTCAAGA CTTGGTTGTG TTTGTCAACA 400 

AAGTCGATAC TATTGATGAC CCTGAAATGT TGGAATTAGT CGAAATGGAA 450 

ATGAGAGAAT TGTTATCCAC CTACGGTTTT GATGGTGACA ACACTCCAGT 500 

TATTATGGGA TCTGCTTTAA- TGGCTTTGGA AGACAAGAAA CCAGAAATTG 550 

GTAAGGAAGC TATCTTGAAA TTGTTAGATG CTGTCGATGA ACACATTCCA 600 

ACTCCATCAA GAGACTTGGA ACAACCATTT TTGTTACCAG TTGAAGACGT 650 

GTTCTCCATC TCCGGTAGAG GAACTGTTGT CACTGGTAGA GTTGAAAGAG 700 

GTGTTTTGAA GAAGGGTGAA GAAATCGAAA TTGTTGGTGG TTTTGACAAA 750 

CCTTACAAGA CTACTGTTAC CGGTATTGAA ATGTTCAAAA AAGAATTAGA 800 

CTCTGCTATG GCTGGTGACA ACTGTGGTGT TTTGTTAAGA GGTGTTAAAA 850 

GAGATGAAAT CAAGAGAGGT ATGGTTTTGG CCAAACCAGG TACTGCTACT 900 

TCTCACAAGA AGTTCTTGGC TTCCTTGTAT ATTTTGACTT CCGAAGAAGG 950 

TGGTCGTTCC ACTCCATTTG GTGAAGGTTA CAAGCCTCAA TGCTTCTTCA 1000 

GAACTAACGA TGTCACTACC ACATTTTCAT TCCCAGAAGG AGAAGGTGTT 1050 

GATCATTCTC AAATGAT CAT GCCAGGTGAC AACATTGAAA TGGTTGGTGA 1100- 

ATTGATCAAA TCTTGTCCAT TAGAAGTCAA CCAACGTTTC AACTTGAGAG 1150 

AAGGTGGTAA AACTGTTGGT ACTGGTTTGA TTACCAGAAT CATCGAATAA 1200 

ACAGAATGTG CACTGTGAAT AATAAAAAGA AAAGAGGTAT ATATAGGTGA 1250 

CTTTGTATTT TGTATTGAAC AATAAAATTC TGTAAATAGT AAGGGCCTCA 13 00 

GAAGTTTTGA TTTGATTTAT GCCATGTGGA CTTGTAGAGA TATCCTTCTC 1350 

AAACTTCTTG 13 60 



2) INFORMATION FOR SEQ ID NO: 2005 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1342 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

1044 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Schizosaccharomyces pombe 
(C) ACCESSION NUMBER: Genome project 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2005 



AAGCCGCATG TCAATATTGG TACTATTGGT 
GACGTTGACG GCTGCTATTA CTAAATGCCT 
GTTTTATGGA TTATAGTCAA ATTGACAAGG 
GGTATTACCA TTTCATCTGC CCATGTTGAA 
CTATGCCCAT GTGGATTGTC CTGGTCACGC 
TTACTGGTGC TGCTACAATG GATGGCGCTA 
GATGGTCAAA TGCCTCAAAC TCGTGAACAT 
CGGTGTAAAG CAAATTGTTG TATACATCAA 
CTGATATGAT CGAGCTTGTC GAAATGGAAA 
TACGGATTTG ATGGTGACAA TACTCCAATT 
TGCCTTAGAG GGTCGTGAGC CTGAGATTGG 
TGATGGAAGC TGTTGATAGT TATATTACTC 
GTCCCTTTCT TGATGGCCAT CGAGGACGTT 
AACTGTAGTC ACTGGCCGTG TCGAGCGCGG 
AAATCGAAAT CGTCGGTTAT GGTAGCCATT 
ATTGAAATGT TCAAAAAGCA GCTTGATGCC 
TGGCCTTTTA CTTCGTTCTA TCAAGCGAGA 
TTGTCGCTCA ACCAGGAACC GTTGCTCCTC 
TTCTATATTT TGACAAAAGA GGAAGGAGGT 
CAAGTATCGT CCCCAACTGT ACAGTCGTAC 
TTACCCACCC TGATCCTAAC GACTCAGACA 
AATGTCGAGA TGATCTGTAC GCTTATTCAC 
ACAACGCTTC ACAGTTCGTG AGGGTGGAAG 
TTACTGAACT TTTGGATTAG TGCATTTATG 
TTTGCATGCT GAATACCAAT ATTATGTCCC 
CAGTGTCATT ATTGTAATAA GACTTTTGCA 
ATACTTTTAT AGTTTCTACT ATTGTTAGCC 



/"» a. r vr^ r T tr T , r* a r*r* 
vJAlGX 1GAUC 


A PPPTA AAA/** 

AGGG 1 aaaac 


50 


rprn/™irp/^f t\ rn /nrpm 

1 1L1 Lj A 1L1 1 


GG 1 G AAGC I A 


100 


\J^\~ U Uj AGG A 


AAA nPP7\ rinm 

AAAGGG AGG I 


150 


1 ACGAAAG 1 G 


G rAATGGTCA 


200 


GGA1 I AGAi I 


A A A A HP TV rp/^i A 

AAG AAT A I G A 


250 


TGA1 1G1 rGT 


TTGTGCTACC 


300 


I 1 GG 1 1 CI GG 


G TCGTCAAGl 


350 


T< A A A /""T» /"'/'"' A HP 

T AAAG I CGAT 


ATGGTCGAGC 


400 


TGCG TGAGCT 


ACTCTCCGAA 


450 


Gil AGCGGCA 


G TGCTTTATG 


c r\ r\ 

500 


1G1 GAA1AG1 


7V 1 1 11 1 1 7V /"irp TV 7V TV rn 

ATTACTAAAT 


r r n 

550 


TTCCTGAAAG 


AAAAACGGAT 


600 


TTTTCAATTT 


CAGGTCGCGG 


650 


TACTTTAAAG 


AAGGGTGCTG 


700 


TAAAGACTAC 


CGTTACTGGA 


750 


GCCGTTGCCG 


GTGACAATTG 


800 


GCAATTAAAA 


CGTGGAATGA 


850 


ATCAGAAATT 


CAAGGCATCA 


900 


CGTCGTACCG 


GTTTCGTTGA 


950 


TTCCGACGTT 


ACTGTCGAAC 


1000 


AAATGGTTAT 


GCCTGGAGAC 


1050 


CCCATTGTCA 


TCGAAAAAGG 


1100 


CACTGTAGGC 


ACAGCTTTGG 


1150 


AACTTATTGG 


CTTTAAAAAT 


1200 


TTCTCAGAAT 


TCTATAACTA 


1250 


TCCATTGACA 


ATGGTATTTG 


1300 


AAAGTTATAA 


AA 


1342 



2) INFORMATION FOR SEQ ID NO: 2006 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2006 
TGGAGCCGGT GAGCGTGG 



2) INFORMATION FOR SEQ ID NO: 2007 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2007 
TGGAGCCAGT GAGCGTGG 



2) INFORMATION FOR SEQ ID NO: 2008 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2008 
TCTGGAGCCG ATGAGCGTG 



2) INFORMATION FOR SEQ ID NO: 2009 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2009 
CTGGAGCCAG TAAGCGTGG 



2) INFORMATION FOR SEQ ID NO: 2010 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: KMK107 

(C) ACCESSION NUMBER: AF027199 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2010 

ATGAGTATTC AACATTTTCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 50 

TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 100 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC 150 

AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 200 

GAGCACTTTT AAAGTTCTGC TATGTGGTGC GGTATTATCC CGTGTTGACG 250 

CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 300 

GTTAAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 350 

AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCTGCCA 400 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 450 

CACAACATGG GGGATCATGT AACTCGCCTT GATCGTTGGG AACCGGAGCT 500 

GAATGAAGCC ATAC CAAACG ACGAGCGTGA CACCACGACG CCTGCAGCAA 550 

TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 600 

TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 650 

ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 700 

GAGCCAGTGA GCGTGGGTCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT 750 

GGTAAGCCCT CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC 800 

TATGGATGAA CGAAATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA 850 

AGCATTGGTA A 861 



2) INFORMATION FOR SEQ ID NO: 2011 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 861 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae 

(B) STRAIN: CLSiS L-491 

(C) ACCESSION NUMBER: • 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2011 

ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 
TTGCTTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 
CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAGCT GGATCTCAAC 
AGCGGTAAGA TCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT 
GAGCACTTTT AAAGTTCTGC TATGTGGTGC GGTATTATCC CGTGTTGACG 
CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA GAATGACTTG 
GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 
AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCGGCCA 
ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG 
CACAACATGG GGGATCATGT AACCCGCCTT GATCGTTGGG AACCGGAGCT 
GAATGAAGCC ATACCAAACG ACGAGCGTGA CACCACGATG CCTGCAGCAA 
TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT TACTCTAGCT 

1047 
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TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 
ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 
GAGCCAGTAA GCGTGGATCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT 
GGTAAGCCCT CCCGTATCGT AGTTATCTAC ATGACGGGGA GTCAGGCAAC 
TATGGATGAA CGAAATAGAC AGATCGCTGA GATAGGTGCC TCACTGATTA 
AGCATTGGTA A 



650 
700 
750 
800 
850 
861 



2) INFORMATION FOR SEQ ID NO: 2012 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2012 
CCGCGGATTA TTAAACCGCC CTTCCGCGG-MR-HEG-ATGTCAGAGG GATAGATCCA 4 9 



2) INFORMATION FOR SEQ ID NO: 2013 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 353 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera ascorbata 

(B) STRAIN: ATCC 33433 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2013 

AGCTTAAGAA CTCTTATCTG GATTACGCGA TGTCGGTCAT TGTTGGCCGT 50 

GCGCTGCCGG ATGTCCGAGA TGGCCTGAAG CCGGTACACC GTCGCGTACT 100 

TTACGCCATG AACGTATTGG GCAATGACTG GAACAAAGCC TACAAAAAAT 150 

CAGCCCGTGT CGTGGGTGAC GTGATCGGTA AATATCACCC GCATGGTGAT 200 

ACTGCCGTCT ATGACACTAT CGTCCGTATG GCACAGCCAT TCTCACTGCG 250 

ATACATGCTG GTAGATGGTC AAGGTAACTT CGGTTCTGTC- GATGGCGACT 300 

CCGCCGCAGC GATGCGTTAT ACGGAAATCC GTATGTCGAA AATCGCCCAT 350 

GAG 353 



2) INFORMATION FOR SEQ ID NO: 2014 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 355 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kluyvera georgiana 

(B) STRAIN: ATCC 51603 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2014 

AGCTCCTATC TGGATTATGC GATGTCGGTC ATTGTTGGCC GTGCGCTGCC 50 

AGATGTCCGA GATGGCCTGA AGCCGGTACA CCGTCGCGTA CTTTACGCCA 100 

TGAACGTACT AGGCAATGAC TGGAACAAAG CCTATAAAAA ATCTGCCCGT 150 

GTCGTTGGTG ACGTAATCGG TAAATACCAT CCCCATGGTG ACTCGGCGGT 200 

CTATGACACG ATCGTCCGCA TGGCGCAGCC ATTCTCGCTG CGTTATATGC 250 

TGGTAGACGG TCAGGGTAAC TTCGGTTCTA TCGACGGCGA CTCTGCGGCG 300 

GCAATGCGTT ATACGGAAAT CCGTCTGGCG AAAATTGCCC ATGAACTGAT 350 

GGCCG 355 



2) INFORMATION FOR SEQ ID NO: 2015 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA , 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2015 
CCAAGAAGCT CAAAAACATC TG 



2) INFORMATION FOR SEQ ID NO: 2016 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2016 

VI 

TADCCTGTCC AWACAGCCAT 



2) INFORMATION FOR SEQ ID NO: 2017 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2017 
ACTTTGAATA AGGTCGGTCT AG 



2) INFORMATION FOR SEQ ID NO: 2018 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2018 
ACACTAAACA AGGTTGGTTT AG 



2) INFORMATION FOR SEQ ID NO: 2019 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2019 
ACACTAAACA AGGTCGGTCT AG 



2) INFORMATION FOR SEQ ID NO: 2020 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2020 
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2) INFORMATION FOR SEQ ID NO: 2021 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2021 
GTAGCTCCAG ACGAAATGTT TG 



2) INFORMATION FOR SEQ ID NO: 2022 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2022 
GTAGCTCCAG ATGAAACGTT TG 



2) INFORMATION FOR SEQ ID NO: 2023 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2023 
GTAACTCCAG ATGAAATGTT TG 



2) INFORMATION FOR SEQ ID NO: 2024 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 



(ii) 



MOLECULE TYPE: DNA 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 2024 



AGTGAAAAGA TGGCTGCTGC 



20 



2) INFORMATION FOR SEQ ID NO: 2025 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2025 
AGTGAGAAAA TGGCTGCTGC 20 



2) INFORMATION FOR SEQ ID NO: 2026 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2026 
TCCAAGCATG CATTATGCAA ACG 23 



2) INFORMATION FOR SEQ ID NO: 2027 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2027 
TCGGTCTAGA TAGAGCTAAA ACG 23 
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2) INFORMATION FOR SEQ ID NO: 2028 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2028 
TATGCTCTTC AACAATCACG 20 



2) INFORMATION FOR SEQ ID NO: 2029 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2029 
AGCCGTTGAG ACTTTGAATA AG 22 



2) INFORMATION FOR SEQ ID NO: 2030 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2030 
CTTAATGGTC TTGGTATCG 19 



2) INFORMATION FOR SEQ ID NO: 2031 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2031 
CGTGACTGGG GTTCTGCTAT GA 

2) INFORMATION FOR SEQ ID NO: 2032 

(i) SEQUENCE CHARACTERISTICS: 

(A) . LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2032 
CGTGACTGGG GATCATCAAT GA 22 
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22 



2) INFORMATION FOR SEQ ID NO: 2033 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2033 
CGTGACTGGG GTTCTGCCAT GA 22 



2) INFORMATION FOR SEQ ID NO: 2034 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2034 
ATCAAGAACA CTGGCTATGT AG 22 
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2) INFORMATION FOR SEQ ID NO: 2035 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2035 
ATCAAGAACA CTGGCTACGT AG 22 



2) INFORMATION FOR SEQ ID NO: 2036 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2036 
ATCAAGAACA CTGGTTACGT AG 22 



2) INFORMATION FOR SEQ ID NO: 2037 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2037 
ATCAAAAATA CTGGTTATGT AG 22 



2) INFORMATION FOR SEQ ID NO: 2038 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2038 

ATCAAGAATA CTGGCTACGT AG 

2) INFORMATION FOR SEQ ID NO: 2039 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2039 
ATCAAAAACA CTGGCTATGT AG 



2) INFORMATION FOR SEQ ID NO: 2040 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 19 bases 
. (B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2040 
TGTGACCCCA GACAAACCC 



2) INFORMATION FOR SEQ ID NO: 2041 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2041 
GTTGAGCGGC AGCACTATCT 



2) INFORMATION FOR SEQ ID NO: 2042 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2042 
CACGGGGATT TCTCTATTTA 20 



2) INFORMATION FOR SEQ ID NO: 2043 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2043 
CACGGGGATT ACTCTATTTA 20 



2) INFORMATION FOR SEQ ID NO: 2044 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2044 
ACCGTAAGTC GGCCAAGTCA , 20 



2) INFORMATION FOR SEQ ID NO: 2045 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2045 
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GTTCTTTCTC CGTATCGTC 19 



2) INFORMATION FOR SEQ ID NO: 2046 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2046 
ACGGGGATTT TTCTATCTAT 20 



2) INFORMATION FOR SEQ ID NO: 2047 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2160 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: CS109 

(C) ACCESSION NUMBER: Z49094 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2047 

ATGAACAAAC CAACGATTCT GCGCCTAATC AAGTATCTGA GCATTAGCTT 50 

CTTAAGCTTG GTTATCGCAG CCATTGTCTT AGGCGGAGGA GTTTTTTTCT 100 

ACTATGTTAG CAAGGCTCCT AGCCTATCCG AGAGTAAACT AGTTGCAACA 150 

ACCTCTAGTA AAATCTACGA CAATAAAAAT CAACTCATTG CTGACTTGGG 200 

TTCTGAACGC CGCGTCAATG CCCAAGCTAA TGATATTCCC ACAGATTTGG 250 

TTAAGGCAAT CGTTTCTATC GAAGACCATC GCTTCTTCGA CCACAGGGGG 300 

ATTGATACCA TCCGTATCCT GGGAGCTTTC TTGCGCAATC TGCAAAGCAA 3 50 

TTCCCTCCAA GGTGGATCAA CTCTCACCCA ACAGTTGATT AAATTGACTT 400 

ACTTTTCAAC CTCGACTTCC GACCAGACTA TTTCTCGTAA GGCTCAGGAA 4 50 

GCTTGGTTAG CGATTCAGTT AGAACAAAAA GCAACCAAAC AGGAAATCTT 500 

GACCTACTAT ATAAATAAGG TCTACATGTC TAATGGCAAC TATGGAATGC 550 

AGACAGCAGC TCAAAACTAC TATGGTAAAG ACCTCAATAA TTTAAGTTTA 600 

CCTCAGTTAG CCTTGCTGGC TGGAATGCCT CAGGCACCAA ACCAATATGA 650 

CCCCTATTCA CATCCAGAAG CAGCCCAAGA CCGCCGAAAC TTGGTCTTAT 700 

CTGAAATGAA AAATCAAGGT TACATCTCTG CTGAACAGTA TGAGAAAGCA 750 

GTCAATACAC CAATTACTGA TGGACTACAA AGTCTCAAAT CAGCAAGTAA 800 

TTACCCTGCT TACATGGATA ATTACCTCAA GGAAGTCATC AATCAAGTTG 850 

AAGAAGAAAC TGGCTATAAC CTTCTAACTA CTGGGATGGA TGTTTACACA 900 
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AATGTAGACC AAGAAGCTCA AAAACATCTG 
TCAATACGTC TCTTACCCTG ACGATGATTT 
TAGATGTTTC AAATGGTAAA GTCATCGCAC 
GCAAGTAATG TTTCATTCGG TACCAACCAG 
CTGGGGATCA TCAATGAAAC CAATCACTGA 
ATGGAGTCTA TGACTCTACT GCTTCTATTG 
TATCCTGGCA CTGATACTCC ACTCTACAAC 
AAACATTACA ATCCAGTATG CTCTTCAACA 
TTGAGACTTT GAATAAGGTC GGTCTAGATA 
GGTCTTGGTA TCGACTATCC AAGCATGCAT 
TAACACAACT GAATCCAACA AAAAATATGG 
CTGCTGCCTA CGCTGCTTTT GCTAATGGTG 
TATATCAATA AAATCGTCTT TAGTGATGGT 
TGCTGGTACA CGAGCTATGA AAGAGACTAC 
TGATGAAAAC TGTTTTAACT TACGGAACAG 
TGGCTTCCAC AAGCAGGTAA GACAGGTACT 
AATTGAAAAG TATATCAAGA ACACTGGTTA 
TTGTAGGGTA TACCCGTAAA TATGCAATGG 
AATCGTCTAA CTCCAATCAT CGGAGATGGT 
CTATCGTTCA ATGATAACTT ACCTTTCTGA 
GGACAATGCC AGATGGCTTG TATAGAAATG 
GGTGCTCGTT CTACGTGGAG CTCACCTGCT 
TGAAAGTTCA AGCTCATCAT CAGATAGTTC 
CCACTCCAAG CACAAATAAT AGTACGACTA 
CAACAATCAA ATACAACCCC TGATCAACAA 
ACAACCATAA 
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TGGGATATCT ACAACTCCGA 950 

GCAAGTCGCA TCTACGGTCG 1000 

AACTTGGTGC TCGTCATCAA 1050 

GCCGTAGAAA CCAATCGTGA 1100 

CTATGCTCCC GCTTTAGAAT 1150 

TACATGATGT CCCTTATAAC 1200 

TGGGATCATG TCTACTTTGG 1250 

ATCACGAAAT GTCACAGCCG 1300 

GAGCTAAAAC CTTCCTTAAT 1350 

TATGCAAACG CCATTTCAAG 1400 

TGCAAGTAGT GAAAAAATGG 1450 

GTATTTATCA CAAACCAATG 1500 

AGCGAAAAAG AATTTTCTGA 1550 

TGCCTATATG ATGACTGAAA 1600 

GACGTGGAGC CTACCTACCA 1650 

TCTAACTATA CTGACGAAGA 1700 

CGTAGCTCCA GATGAAATGT 1750 

CTGTTTGGAC AGGATACTCA 1800 

TTCCTTGTTG CTGGTAAAGT 1850 

AGATG AC CAA CCTGGAGATT 1900 

GAGAATTCGT ATTTAAAAAT 1950 

CCACAACAAC CCCCATCAAC 2000 

AACTTCACAG TCTAACTCAA 2050 

CCAATCCTAA CAATAATACG 2100 

AATCAGAATC CTCAACCAGC 2150 

2160 



2) INFORMATION FOR SEQ ID NO: 2048 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2160 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: R6 

(C) ACCESSION NUMBER: M90527 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2048 

ATGAACAAAC CAACGATTCT GCGCCTAATC AAGTATCTGA GCATTAGCTT 50 

CTTAAGCTTG GTTATCGCAG CCATTGTCTT AGGCGGAGGA GTTTTTTTCT 100 

ACTACGTTAG CAAGGCTCCT AGCCTATCCG AGAGTAAACT AGTTGCAACA 150 

ACTTCTAGTA AAATCTACGA CAATAAAAAT CAACTCATTG CTGACTTGGG 2 00 

TTCTGAACGC CGCGTCAATG CCCAAGCTAA TGATATTCCC ACAGATTTGG 250 

TTAAGGCAAT CGTTTCTATC GAAGACCATC GCTTCTTCGA CCACAGGGGG 300 

ATTGATACCA TCCGTATCCT GGGAGCTTTC TTGCGCAATC TGCAAAGCAA 350 

TTCCCTCCAA GGTGGAT C AG CTCTCACTCA ACAGTTGATT AAGTTGACTT 400 

ACTTTTCAAC TTCGACTTCC GACCAGACTA TTTCTCGTAA GGCTCAGGAA 450 

GCTTGGTTAG CGATTCAGTT AGAACAAAAA GCAACCAAGC AAGAAATCTT 500 

GACCTACTAT ATAAATAAGG TCTACATGTC TAATGGGAAC TATGGAATGC 550 
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AGACAGCAGC TCAAAACTAC TATGGTAAAG 
CCTCAGTTAG CCTTGCTGGC TGGAATGCCT 
CCCCTATTCA CATCCAGAAG CAGCCCAAGA 
CTGAAATGAA AAATCAAGGC TACATCTCTG 
GTCAATACAC CAATTACTGA TGGGCTACAA 
TTACCCTGCT TACATGGATA ATTACCTCAA 
AAGAAGAAAC AGGCTATAAC CTACTCACAA 
AATGTAGACC AAGAAGCTCA AAAACATCTG 
CGAATACGTT GCCTATCCAG ACGATGAATT 
TTGATGTTTC TAACGGTAAA GTCATTGCCC 
TCAAGTAATG TTTCCTTCGG AATTAACCAA 
CTGGGGATCA ACTATGAAAC CGATCACAGA 
ACGGTGTCTA CGAGTCAACT GCCACTATCG 
TACCCTGGGA CAAATACCCC TGTTTATAAC 
CAACATCACC TTGCAATACG CCCTGCAACA 
TGGAAACTCT AAACAAGGTC GGACTCAACC 
GGTCTCGGAA TCGACTACCC AAGTATTCAC 
TAACACAACC GAATCAGACA AAAAATATGG 
CTGCTGCTTA CGCTGCCTTT GCAAATGGTG 
TATATCCATA AAGTCGTCTT TAGTGATGGG 
TGTCGGAACT CGTGCCATGA AAGAAACGAC 
TGATGAAAAC AGTCTTGAGT TATGGAACTG 
TGGCTCCCTC AGGCTGGTAA AACAGGAACC 
AATTGAAAAC CACATCAAGA CCTCTCAATT 
TTGCTGGCTA TACGCGTAAA TATTCAATGG 
AACCGTCTGA CACCACTTGT AGGCAATGGC 
TTACCGCTCT ATGATGACCT ACCTGTCTGA 
GGAATATACC AGAGGGGCTC TACAGAAATG 
GGTGCTCGTT CTACGTGGAG CTCACCTGCT 
TGAAAGTTCA AGCTCATCAT CAGATAGTTC 
CCACTCCAAG CACAAATAAT AGTACGACTA 
CAACAATCAA ATACAACCCC TGATCAACAA 
ACAACCATAA 
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ACCTCAATAA TTTAAGTTTA 600 

CAGGCACCAA ACCAATATGA 650 

CCGCCGAAAC TTGGTCTTAT 700 

CTGAACAGTA TGAGAAAGCA 750 

AGTCTCAAAT CAGCAAGTAA 800 

GGAAGTCATC AATCAAGTTG 850 

CTGGGATGGA TGTCTACACA 900 

TGGGATATTT ACAATACAGA 950 

GCAAGTCGCT TCTACCATTG 1000 

AGCTAGGAGC ACGCCATCAG 1050 

GCAGTAGAAA CAAACCGCGA 1100 

CTATGCTCCT GCCTTGGAGT 1150 

TTCACGATGA GCC CTATAAC 1200 

TGGGATAGGG GCTACTTTGG 12 50 

ATCGCGAAAC GTCCCAGCCG 1300 

GCGCCAAGAC TTTCCTAAAT 1350 

TACTCAAATG CCATTTCAAG 1400 

AGCAAGTAGT GAAAAGATGG 1450 

GAACTTACTA TAAACCAATG 1500 

AGTGAAAAAG AGTTCTCTAA 1550 

AGCCTATATG ATGAC CGACA 1600 

GACGAAATGC CTATCTTGCT 1650 

TCTAACTATA CAGACGAGGA 1700 

TGTAGCACCT GATGAACTAT 1750 

CTGTATGGAC AGGCTATTCT 1800 

CTTACGGTCG CTGCCAAAGT 1850 

AGGAAGCAAT CCAGAGGATT 1900 

GAGAATTCGT ATTTAAAAAT 1950 

CCACAACAAC CCCCATCAAC 2000 

AACTTCACAG TCTAGCTCAA 2050 

CCAATCCTAA CAATAATACG 2100 

AATCAGAATC CTCAACCAGC 2150 

2160 



2) INFORMATION FOR SEQ ID NO: 2049 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2160 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: URU-E159 

(C) ACCESSION NUMBER: AF139890 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 204 9 

ATGAACAAAC CAACGATTCT GCGCCTAATC AAGTATCTGA GCATTAGCTT 50 

CTTAAGCTTG GTTATCGCAG CCATTGTCTT- AGGCGGAGGA GTTTTTTTCT 100 

ACTACGTTAG CAAGGCTCCT AGCCTATCCG AGAGTAAACT AGTTGCAACA 150 

ACTTCTAGTA AAATCTACGA CAATAAAAAT CAACTCATTG CTGACTTGGG 200 
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TTCTGAACGC CGCGTCAATG CCCAAGCTAA TGATATTCCC ACAGATTTGG 250 

TTAAGGCAAT CGTTTCTATC GAAGACCATC GCTTCTTCGA CCACAGGGGG 3 00 

ATTGATTCCA TCCGTATCCT GGGAGCTTTC TTGCGCAATC TGCAAAGTAA 350 

TTCCCTCCAA GGTGGATCAA CTCTCACCCA ACAGTTGATT AAGTTGACTT 400 

ACTTTTCAAC CTCGACTTCC GACCAGACTA TTTCTCGTAA GGCTCAGGAA 450 

GCTTGGTTAG CGATTCAGTT AGAACAAAAA GCAACCAAAC AGGAAATCTT 500 

GACCTACTAT ATAAATAAGG TCTACATGTC TAATGGCAAC TATGGAATGC 550 

AGACAGCAGC TCAAAACTAC TATGGTAAAG ACCTCAATAA TTTAAGTTTA 600 

CCTCAGTTAG CCTTGCTGGC TGGAATGCCT CAGGCACCAA ACCAATATGA 650 

CCCCTATTCA CATCCAGAAG CAGCCCAAGA CCGCCGAAAC TTGGTCTTAT 700 

CTGAAATGAA AAATCAAGGT TACATCTCTG CTGAACAGTA TGAGAAAGCA 750 

GTCAATACAC CAATTACTGA TGGACTACAA AGTCTCAAAT CAGCAAGTAA 800 

TTACCCTGCT TACATGGATA ATTACCTCAA GGAGGTCATC AATCAAGTAG 850 

AACAAGAAAC TGGCTATAAC CTTCTAACTA CTGGGATGGA TGTTTACACA 900 

AATGTAGACC AAGAAGCTCA AAAACATCTG TGGGATATCT ACAACTCCGA 950 

TCAATACGTC TCTTACCCTG ACGATGATTT GCAAGTCGCA TCTACGGTCG 1000 

TAGATGTTTC AAATGGTAAA GTCATCGCCC AACTTGGAGC TCGTCACCAA 1050 

GCAAGTAACG TTTCATTTGG TACCAACCAA GCTGTGGAAA CCAATCGTGA 1100 

CTGGGGTTCT GCTATGAAAC CAATCACCGA TTATGCACCT GCCATAGAAT 1150 

ACGGTGTTTA TGATTCCACT GCAACTATGG TTAATGATAT TCCTTATAAC 1200 

TATCCGGGAA CAAGCACACC TGTCTACAAC TGGGATAGAG CATATTTCGG 1250 

TAATATTACT CTGCAATATG CTCTTCAACA ATCACGAAAT GTCACAGCCG 13 00 

TTGAGACTTT GAATAAGGTC GGT CTAGATA GAGCTAAAAC CTTCCTTAAT 1350 

GGTCTTGGTA TCGACTATCC AAGCATGCAT TATGCAAACG CCATTTCAAG 1400 

TAATACAACA GAATCTAATA AACAATACGG AGCAAGTAGT GAAAAAATGG 14 50 

CTGCTGCTTA TGCTGCCTTT GCAAATGGTG GCACTTACTA TAAACCAATG 1500 

TATATCCATA AAGTCGTCTT CAGTGATGGA AGTAAAAAAG AGTTCTCTAA 1550 
TGTCGGAACT CGTGCCATGA AGGAAACGAC AGCCTATATG ATGACCGACA . 1600 

TGATGAAAAC AGTCTTGACT TATGGAACTG GGCGTGGAGC CTATCTTCCT 1650 

TGGCTTCCTC AAGCTGGTAA AACAGGAACC TCTAACTATA CAGATGAGGA 1700 

AGTTGAAAAC CACATCAAGA ACACTGGCTA TGTAGCTCCA GATGAAATGT 1750 

TTGTTGGTTA TACTCGTAAG TATTCTATGG CTGTATGGAC AGGTTATTCG 1800 

AATCGTTTAA CTCCTATCGT TGGAGATGGT TTCCTAGTTG CAGCTAAAGT 1850 

TTATCGCTCA ATGATAACGT ATCTATCAGA AGATACTCAT CCAGAAGACT 1900 

GGACGATGCC AGACGGACTT TTCAGAAATG GAGAATT CGT ATTTAAAAAT 1950 

GGTGCTCGTT CTACGTGGAA CTCACCTGCT CCACAACAAC CCCCATCAAC 2000 

TGAAAGTTCA AGCTCATCAT GAGATAGTTC AACTTCACAG TCTAGCTCAA 2050 

CCACTCCAAG CACAAATAAT AGTACGACTA CCAATCCTAA CAATAATACG 2100 

CAACAATCAA ATACAACCCC TGATCAACAA AATCAGAATC CTCAACCAGC 2150 

ACAACCATAA 2160 



2) INFORMATION FOR SEQ ID NO: 2050 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 930 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 8303; 35193 

(C) ACCESSION NUMBER: AF046230 

1061 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2050 

CTGTGGGATA TCTACAACTC CGATCAATAC GTCTCTTACC CTGACGATGA 50 

TTTGCAAGTC GCATCTACGG TCGTAGATGT TTCAAATGGT AAAGTCATCG 100 

CCCAACTTGG AGCTCGTCAC CAAGCAAGTA ACGTTTCATT TGGTACCAAC 150 

CAAGCTGTGG AAACCAATCG TGACTGGGGT TCTGCTATGA AACCAATCAC 200 

CGATTATGCA CCTGCCATAG AATACGGTGT TTATGATTCC ACTGCAACTA 250 

TGGTTAATGA TATTCCTTAT AACTATCCGG GAACAAGCAC ACCTGTCTAC 300 

AACTGGGATA GAGCATATTT CGGTAATATT ACTCTGCAAT ATGCTCTTCA 350 

ACAATCACGA AATGTCACAG CCGTTGAGAC TTTGAATAAG GTCGGTCTAG 400 

ATAGAGCTAA AACCTTCCTT AATGGTCTTG GTATCGACTA TCCAAGCATG 450 

CATTATGCAA ACGCCATTTC AAGTAATACA ACAGAATCTA ATAAACAATA 500 

CGGAGCAAGT AGTGAAAAAA TGGCTGCTGC TTATGCTGCC TTTGCAAATG 550 

GTGGCACTTA CTATAAACCA ATGTATATCC ATAAAGTCGT CTTCAGTGAT 600 

GGAAGTAAAA AAGAGTTCTC TAATGTCGGA ACTCGTGCCA TGAAGGAAAC 650 

GACAGCCTAT ATGATGACCG ACATGATGAA AACAGTCTTG ACTTATGGAA 700 

CTGGGCGTGG AGCCTATCTT CCTTGGCTTC CTCAAGCTGG TAAAACAGGA 750 

ACCTCTAACT ATACAGATGA GGAAGTTGAA AACCACATCA AGAACACTGG 800 

CTATGTAGCT CCAGATGAAA TGTTTGTTGG TTATACT CGT AAGTATTCTA 850 

TGGCTGTATG GACAGGTTAT TCGAATCGTT TAACTCCTAT CGTTGGAGAT 900 

GGTTTCCTAG TTGCAGCTAA AGTTTATCGC 930 



2) INFORMATION FOR SEQ ID NO: 2051 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 930 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 63509; Mil 

(C) ACCESSION NUMBER: AF04 6238 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2051 

CTGTGGGATA TTTACAATAC AGACGAATAC GTTGCCTATC CAGATGATGA 50 

CATGCAAGTA GCTTCGACTA TTGTAGATGT ATCTAATGGT AACGTTATTG 100 

CACAACTTGG TGCTCGTCAT CAAGCAAGTA ATGTTTCATT CGGCACCAAC 150 

CAGGCTGTGG AGAC CAATCG TGACTGGGGT TCTTCTATGA AACCAATCAC 200 

TGACTATGCT CCCGCTTTAG AATATGGAGT CTATGACTCT ACTGCTTCTA 250 

TTGTACATGA TGTTCCTTAT AACTATCCTG GCACTGATAC TCCAGTCTAC 300 

AACTGGGATC ATGTCTACTT TGGAAACATT ACAATCCAGT ATGCTCTTCA 350 

ACAATCACGA AATGTCACAG CCGTTGAGAC TTTGAATAAG GTCGGTCTAG 400 

ATAGAGCTAA AACCTTCCTT AATGGTCTTG GTATCGACTA TCCAAGCATG 450 

CATTATGCAA ACGCCATTTC AAGTAACACA ACTGAATCCA ACAAACAGTA 500 

CGGTGCAAGT AGTGAAAAGA TGGCTGCTGC TTATGCCGCC TTTTCTAATG 550 

GTGGTATTTA CCACAAACCA ATGTATATCA ATAAAATCGT CTTCAGTGAT 600 

GGTAGTGAAA AAGAATTTTC TGATGCTGGT ACACGAGCTA TGAAAGAAAC 650 

TACTGCCTAT ATGATGACCG AAATGATGAA AACTGTTTTA GCTTACGGAA 700 

CCGGACGTGG TGCTTACCTA CCATGGCTTC CACAAGCAGG TAAGACAGGT 750 
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ACTTCTAACT ATACTGACGA CGAAATTGAA 
CTACGTAGCT CCAGATGAAA TGTTTGTTGG 
TGGCTGTATG GACTGGTTAC TCAAATCGTT 
GGTTTCCTAG TTGCTGCTAA GGTTTATCGC 
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AAGTATATCA AGAACACTGG 800 
TTATACTCGT AAGTATTCTA 850 
TAACTCCAAT CGTAGGAGAT 900 

930 



2) INFORMATION FOR SEQ ID NO: 2052 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1195 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: #22/HA5 

(C) ACCESSION NUMBER: AB006877 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2052 

GTCTCTTACC CTGACGATGA TTTGCAAGTC GCAT CTACGG TCGTAGATGT 50 

TTCCAATGGT AAAGTCATCG CCCAACTTGG AGCTCGTCAC CAAGCAAGTA 100 

ACGTTTCATT TGGTACCAAC CAAGCTGTGG AAACCAATCG TGACTGGGGT 150 

TCAACAATGA AACCAATCAC CGATTATGCA CCTGCCATAG AATACGGTGT 200 

ATATGATTCC ACTGCAACTA TGGTTAATGA TATTCCTTAT AACTATCCGG 250 

GAACAAGCAC ACCTGTCTAC AACTGGGATC GAGCATATTT TGGTAATATT 300 

ACTCTGCAAT ATGCCCTTCA ACAATCACGT AATGTTACAG CAGTTGAAAC 350 

ATTAAACAAG GTTGGTTTAG ATCGAGCCAA AACTTTCCTA AATGGCTTAG 400 

GTATCGATTA CCCAAGCATA CACTATG CTA ACGCCATTTC AAGTAATACA 450 

ACAGAATCTA ACAAACATTA CGGTGCAAGT AGTGAAAAAA TGGCTGCTGC 500 

TTATGCCGCC TTTGCTAATG GTGGTATTTA CCACAAACCA ATGTATATCA 550 

ATAAAATCGT CTTTAGTGAT GGTAGTGAGA AAGAATTTTC TGATGCTGGT 600 

ACACGAGCTA TGAAAGAAAC TACTGCCTAT ATGATG AC CG AAATGATGAA 650 

AACTGTTTTA GCTTACGGAA CCGGACGTGG TGCTTACCTA CCATGGCTTC 700 

CACAAGCAGG TAAGACAGGT ACTTCTAACT ATACTGATGA CGAAATCGAA 750 

AAACATATCA AGAATACTGG CTACGTAGCT CCAGATGAAA TGTTTGTAGG 800 

CTATACTCGT AAGTATTCTA TGGCTGTATG GACTGGTTAC TCAAATCGTT 850 

TAACTCCAAT TGTTGGAGAT GGTTTCCTAG TTGCTGCTAA GGTTTATCGC 900 

TCAATGATAT CGTATCTATC AGAAGATGAC CAACCTGGAG ATTGGACGAT 950 

GCCAGACGGC CTGTTCCGAA ATGGAGAATT CGTATTTAAA AATGGTGCTC 1000 

GTTCTACGTG GAACTCACCT GCTCCACAAC AACCCCCATC AACTGAAAGT 1050 

TCAAGCTCAT CATCAGATAG TTCAACTTCA CAGTCTAGCT CAACCACTCC 1100 

AAGCACAAAT AATAGTACGA CTACCAATCC TAACAATAAT ACGCAACAAT 1150 

CAAATACAAC CCCTGATCAA CAAAATCAGA ATCCTCAACC AGCAC 1195 



2) INFORMATION FOR SEQ ID NO: 2053 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 930 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 17619 

(C) ACCESSION NUMBER: AF046237 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2053 

CTGTGGGATA TTTACAATAC AGACGAATAC GTTGCCTATC CAGACGATGA 50 

ATTGCAAGTC GCTTCTACCA TTGTTGATGT TTCTAACGGT AAAGTCATTG 100 

CCCAGCTAGG AGCACGCCAT CAGTCAAGTA ATGTTTCCTT CGGAATTAAC 150 

CAAGCAGTAG AAACAAACCG CGACTGGGGA TCAACTATGA AACCGATCAC 200 

AGACTATGCT CCTGCCTTGG AGTACGGTGT CTACGATTCA ACTGCTACTA 250 

TCGTTCACGA TGAGCCCTAT AACTACCCTG GGACAAATAC TCCTGTTTAT 300 

AACTGGGATA GGGGCTACTT TGGCAACATC ACCTTGCAAT ACGCCCTGCA 350 

ACAATCGCGA AACGTCCCAG CCGTGGAAAC TCTAAACAAG GTCGGACTCA 400 

ACCGCGCCAA GACTTTCCTA AATGGTCTCG GAATCGACTA CCCAAGTATT 450 

CACTACTCAA ATGCCATTTC AAGTAACACA ACCGAATCAG ACAAAAAATA 500 

TGGAGCAAGT AGTGAAAAGA TGGCTGCTGC TTACGCTGCC TTTGCAAATG 550 

GTGGAACTTA CTATAAACCA ATGTATATCC ATAAAGTCGT CTTTAGTGAT 600 

GGGAGTGAAA AAGAGTTCTC TAATGTCGGA ACTCGTGCCA TGAAGGAAAC 650 

GACAGCCTAT ATGATGACCG AAATGATGAA AACAGTCTTG AGTTATGGAA 700 

CTGGACGAAA TGCCTATCTT GCTTGGCTTC CTCAAGCTGG TAAAACAGGT 750 

ACCTCTAACT ATACAGATGA AGAAATTGAA AAACACATCA AAAACACTGG 800 

CTATGTAGCT CCAGATGAAA CGTTTGTTGG TTATACTCGT AAGTATTCTA 850 

TGGCTGTATG GACAGGTTAC ACAAACCGTC TCACACCAAT TGTGGGGGAT 900 

GGCTTCACAG TTGCTGCCAA AGTTTACCGC 930 



2) INFORMATION FOR SEQ ID NO: 2054 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 306 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: R6 

(C) ACCESSION NUMBER: Y07845 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2054 

CGTCGCATTC TCTACGGAAT GAATGAATTG GGTGTGACCC CAGACAAACC 50 

CCATAAAAAA TCTGCTCGTA TTACAGGGGA TGTCATGGGT AAATACCACC 100 

CACACGGGGA TTCCTCTATT TATGAAGCCA TGGTCCGTAT GGCTCAATGG 150 

TGGAGCTACC GTTACATGCT TGTAGATGGT CATGGGAATT TTGGTTCCAT 200 

GGATGGAGAT AGTGCTGCCG CTCAACGTTA TACCGAGGCA CGTATGAGCA 250 

AGATTGCTCT GGAAATGCTT CGTGATATCA ACAAAAATAC AGTTGATTTC 3 00 

GTTGAT 306 
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2) INFORMATION FOR SEQ ID NO: 2055 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2472 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 7785 

(C) ACCESSION NUMBER: Z67739 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2055 

ATGTCTAACA TTCAAAACAT GTCCCTGGAG GACATCATGG GAGAGCGCTT 50 

TGGTCGCTAC TCCAAGTACA TTATTCAAGA CCGGGCTTTG CCAGATATTC 100 

GTGATGGGTT GAAGCCGGTT CAGCGCCGTA TTCTTTATTC TATGAATAAG 150 

GATAGCAATA CTTTTGACAA GAGCTACCGT AAGTCGGCCA AGTCAGTCGG 200 

GAACATCATG GGGAATTTCC ACCCACACGG GGATTCTTCT ATCTATGATG 250 

CCATGGTTCG TATGTCACAG AACTGGAAAA ATCGTGAGAT TCTAGTTGAA 300 

ATGCACGGTA ATAACGGTTC TATGGACGGA GATCCTCCTG CGGCTATGCG 350 

TTATACTGAG GCACGTTTGT CTGAAATTGC AGGCTACCTT CTTCAGGATA 400 

TCGAGAAAAA GACAGTTCCT TTTGCATGGA ACTTTGACGA TACGGAGAAA 450 

GAACCAACGG TCTTGCCAGC AGCCTTTCCA . AACCTCTTGG TCAATGGTTC 500 

GACTGGGATT TCGGCTGGTT ATGCCACAGA CATTCCTCCC CATAATTTAG 550 

CTGAGGTCAT AGATGCTGCA GTTTACATGA TTGACCACCC AACTGCAAAG 60 0 

ATTGATAAAC TCATGGAATT CTTACCTGGA CCAGACTTCC CTACAGGGGC 650 

TATTATTCAG GGTCGTGATG AAATCAAGAA AGCTTATGAG ACTGGGAAAG 700 

GGCGCGTGGT TGTTCGTTCC AAGACTGAAA TTGAAAAGCT AAAAGGTGGT 750 

AAGGAACAAA TCGTTATTAC TGAGATTCCT TATGAAATCA ATAAGGCCAA 800 

TCTAGTCAAG AAAATCGATG ATGTTCGTGT TAATAACAAG GTAGCTGGGA 850 

TTGCTGAGGT TCGTGATGAG TCTGACCGTG ATGGTCTTCG TATCGCTATC 900 

GAACTTAAGA AAGACGCTAA TACTGAGCTT GTTCTCAACT ACTTATTTAA 950 

GTACACCGAC CTACAAATCA ACTACAACTT TAATATGGTG GCGATTGACA 1000 

ATTTCACACC TCGTCAGGTT GGGATTGTTC CAATCCTGTC TAGCTACATC 1050 

GCTCACCGTC GAGAAGTGAT TTTGGCGCGT TCACGCTTTG ACAAAGAAAA 1100 

GGCTGAGAAA CGTCTCCATA TCGTCGAAGG TTTGATTCGT GTGATTTCGA 1150 

TTTTGGATGA AGTCATTGCT CTTATCCGTG CTTCTGAGAA TAAGGCGGAC 1200 

GCCAAGGAAA ACCTCAAAGT TAGCTATGAT TTTACGGAAG AACAGGCTGA 1250 

GGCTATCGTA ACTTTGCAAC TGTACCGTTT GACCAATACC GATGTGGTTG 1300 

TCTTGCAGGA AGAAGAAGCA GAGCTTCGTG AGAAGATTGC TATGCTGGCG 1350 

GCTATTATCG GTGATGAAAG GACTATGTAC AATCTCATGA AGAAAGAACT 1400 

TCGTGAGGTC AAGAAGAAAT TTGCAACTCC TCGTTTGAGT TCTTTAGAAG 1450 

ACACTGCGAA AGCAATTGAG ATTGATACAG CTAGTCTTAT CGCTGAGGAA 1500 

GATACCTACG TCAGCGTGAC CAAGGCAGGT TACATCAAGC GTACCAGTCC 1550 

ACGTTCCTTT GCGGCTTCCA CCTTGGAAGA AATTGGCAAG CGTGATGATG 1600 

ACCGTTTGAT TTTTGTTCAA TCTGCCAAGA CAACCCAGCA CCTCTTGATG 1650 

TTCACAAGTC TTGGAAATGT CATCTACAGA CCAATCCATG AGTTGGCAGA 1700 

TATTCGTTGG AAGGACATCG GAGAGCATCT GAGCCAAACC ATCACAAACT 1750 

TTGAAACGAA TGAAGAAATC CTTTATGTGG AAGTACTGGA TCAGTTTGAC 1800 

GATGCGACAA CCTACTTTGC AGTGACTCGC CTTGGTCAAA TCAAACGGGT 1850 
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AGAGCGAAAA GAATTCACTC CATGGCGGAC 
AGTATGCTAA GCTCAAAGAC GATACAGATC 
ATTAAACTAG ATGATGTTGT CTTGGTTAGT 
TTTCAATATC GAAGAGGTTC CGGTTGTCGG 
AGGCTATGAA TTTGAAAGAA GATGATGTCC 
AATACTTCGT CCTTCTACCT CTTGACCCAG 
TTCTATTGAG GAAATTCTAG CAACCAGCCG 
TCTTGCGTGA GTTGAAAAAC AAACCGCATC 
GTTGCAGAGC AAGGATTTGT TGGCGATTTC 
GAACGACCAA ACTCTGCTTG TCCAATCCAA 
GCCGATTGCA AGACTTGAAC TTGTCAGAAC 
ATTTCTGACA CGATTTCAGA TGAAGAAGTT 
AGTAGTTACT GAAGATAAAT AA 
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CTATAGATCT AAGTCTGTCA 1900 

AGATTGTAGC AGTGGCTCCG 1950 

CAAAATGGTT ATGCCCTGCG 2000 

TGCTAAGGCA GCAGGTGTCA 2050 

TCCAATCTGG CTTTATCTGT 2100 

CGTGGAAGCT TGAAACGTGT 2150 

TGCCAAACGA GGATTACAAG 2200 

GTGTCTTCTT GGCAGGAGCA 2250 

TTCAGTACGG AAGTGGATGT 2300 

TAAAGGAACA ATCTATGAAA 2350 

GCACTAGCAA TGGAAGCTTC 2400 

TTTGACGCTT ATCTTCAGGA 2450 

2472 



2) INFORMATION FOR SEQ ID NO: 2056 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1212 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-16 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2056 

AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA ATACGTCTCT 50 

TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG ATGTTTCAAA 100 

TGGTAAAGTC ATTGCCCAAC TTGGAGCTCG TCACCAAGCA AGTAACGTTT 150 

CATTTGGCAC CAACCAAGCT GTGGAAACCA ATCGTGACTG GGGTTCAACA 200 

ATGAAACCAA TCACAGACTA TGCTCCTGCC TTGGAATACG GTGTCTACGA 250 

TTCAACTGCT ACTATCGTTC ACGATGAGCC CTATAACTAT CCTGGGACAG 300 

ATACCCCTGT CTATAACTGG GATAGGGGCT ACTTTGGCAA TATCACCTTG 350 

CAATACGCCC TGCAACAATC GCGAAACGTC CCAGCCGTGG AAACACTAAA 400 

CAAGGTCGGG CTCAACCGCG CCAAGACTTT TCTAAATGGT CTCGGAATCG 450 

ACTACCCAAG T ATT C ACT AC TCAAATGCCA TTTCAAGTAA CACAACCGAG 500 

TCAGACAAAA AATATGGAGC AAGTAGTGAA AAGATGGCTG CTGCTTACGC 550 

TGCCTTTGCA AATGGTGGAA CTTACTATAA ACCAATGTAT ATCCATAAAG 600 

TCGTCTTTAG TGATGGAAGT GAAAAAGAGT TCTCTAATGT CGGAACTCGC 650 

GCCATGAAAG AAACGACTGC TTACATGATG ACAGAAATGA TGAAAACAGT 700 

CTTGACGTAC GGAATTGGTC GTGGTGCCTA CCTGCCTTGG CTTCCTCAAG 750 

CTGGTAAAAC AGGTACTTCT AACTATACTG ACGAAGAAAT TGAAAAGTAT 800 

ATCAAGAACA CTGGTTACGT AACTCCAGAT GAAATGTTTG TAGGGTATAC 850 

CCGTAAATAT GCAATGGCTG TATGGACAGG CTATTCTAAC CGTCTGACAC 900 

CACTTGTAGG CGATGGCCTT ACGGTCGCTG CCAAGGTTTA CCGCTCTATG 950 

ATGACCTACC TGTCTGAAGG AAGCAATCCA GAGGATTGGA ATATACCAGA 1000 

GGGGCTCTAC AGAAATGGAG AATTCGTATT TAAAAATGGT GCTCGTTCTA 1050 

CGTGGAGCTC ACCTGCTCCA CAACAACCCC CATCAACTGA AAGTTCAAGC 1100 

TCATCATCAG ATAGTTCAAC TTCACAGTCT AGCTCAACCA CTCCAAGCAC 1150 

AAATAATAGT ACGACTACCA ATCCTAACAA TAATACGCAA CAATCAAATA 1200 

CAACCCCTGA TC 1212 
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2) INFORMATION FOR SEQ ID NO: 2057 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1242 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-17 

■ 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2057 



ATGTAGACCA AGAAGCTCAA AAACATCTGT GGGATATTTA CAATACAGAC 50 

GAATACGTTG CCTATCCAGA CGATGAATTG CAAGTCGCTT CTACCATTGT 100 

TGATGTTTCT AACGGGAAAG TTATTGCTCA GTTAGGTTCT CGTCACCAAT 150 

CAAGCAATGT TTCCTTCGGA ATCAACCAAG CTGTTGAAAC CAACCGTGAC 200 

TGGGGTTCTG CCATGAAGCC AATCACAGAC TATGCTCCTG CCTTAGAGTA 250 

TGACATCTAC GACTCAACTG CTTCGATTGT ACATGATGTT CCTTATAACT 3 00 

ATCCAGGTAC TGATACTCCC CTCTACAACT GGGATAAAGT CTACTTTGGA 3 50 

AATATTACAA TCCAGTATGC ACTTCAACAG TCACGTAATG TCACAGCCGT 4 00 

TGAGACTTTG AATAAGGTCG GTCTAGATAG AGCTAAAACC TTTCTTAATG 450 

GTCTTGGTAT CGACTATCCA AGCATGCATT ATGCAAACGC CATTTCAAGT 500 

AATACGACTG AGTCAAACAA AAAGTACGGA GCAAGTAGTG AGAAAATGGC 550 

TGCTGCTTAC GCTGCTTTTG CTAACGGTGG TATCTACCAT AAACCAATGT 600 

ATATCAACAA AATCGTCTTT AGCGATGGTA GCTCAAAAGA ATACGCTGAT 650 

CCTGGTACTC GTGCCATGAA AGAGACGACC GCCTATATGA TGACAGAAAT 7 00 

GATGAAGACT GTCTTGGCAT ACGGAACGGG TCGTGGTGCT TATCTCCCTT 750 

GGCTACCTCA AGCTGGTAAG ACTGGTACAT CAAACTATAC AGATGATGAA 8 00 

ATTGAAAACT ACATCAAAAA TACTGGTTAT GTAGCTCCAG ACGAAATGTT 850 

TGTTGGTTAT ACTCGCAAAT ATTCAATGGC GGTATGGACA GGTTACTCAA 900 

ACCGCCTGAC TCCTATCGTT GGTGATGGCT TCTATGTTGC AGCTAAGGTT 950 

TACCGTTCAA TGATGACTTA TCTGTCTGAG GATAACAACC CTGGCGACTG 1000 

GACTATGCCA GAAGGTCTCT ATCGAAGTGG TGAGTTCGTC TTTAAAAAAG 1050 

GTGCTCGTTC TGCATGGACT GCTCCTGCTC CGCAACAGGC CCCAACACCA 1100 

GAAAGTTCGA GCTCGACATC AGAAAGTTCA ACTTCACAGT CAAGCTCAAC 1150 

TACTCCAAGC ACGAATAATA GTGCAAACAA TAATACCAAT AACCAGCAAC 1200 

CAAATACAAC GCCTGGTCAA CAAAACCAGA ACCAAAATCA GA 1242 



2) INFORMATION FOR SEQ ID NO: 2058 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1225 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-18 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2058 

GAAGCTCAAA AACATCTGTG GGATATTTAC AATACAGACG AATACGTTGC 50 
CTATCCAGAC GATGAATTGC AAGTCGCTTC TACCATTGTT GATGTTTCTA 100 
ACGGGAAAGT TATTGCTCAG TTAGGTTCTC GTCACCAATC AAGCAATGTT 150 
TCCTTCGGAA TCAACCAAGC TGTTGAAACC AACCGTGACT GGGGTTCTGC 200 
CATGAAGCCA ATCACAGACT ATGCTCCTGC CTTAGAGTAT GACATCTACG 250 
ACTCAACTGC TTCGATTGTA CATGATGTTC CTTATAACTA TCCAGGTACT 300 
GATACTCCCC TCTACAACTG GGATAAAGTC TACTTTGGAA ATATTACAAT 350 
CCAGTATGCA CTTCAACAGT CACGTAATGT CACAGCCGTT GAGACTTTGA 4 00 

ATAAGGTCGG TCTAGATAGA GCTAAAACCT TTCTTAATGG TCTTGGTATC 450 
GACTATCCAA GCATGCATTA TGCAAACGCC ATTTCAAGTA ATACGACTGA 500 
GTCAAACAAA AAGTACGGAG CAAGTAGTGA GAAAATGGCT GCTGCTTACG 550 
CTGCTTTTGC TAACGGTGGT ATCTACCATA AACCAATGTA TATCAACAAA 600 
ATCGTCTTTA GCGATGGTAG CTCAAAAGAA TACGCTGATC CTGGTACTCG 650 
TGCCATGAAA GAGACGACCG CCTATATGAT GACAGAAATG ATGAAGACTG 700 
TCTTGGCATA CGGAACGGGT CGTGGTGCTT ATCTCCCTTG GCTACCTCAA 750 
GCTGGTAAGA CTGGTACATC AAACTATACA GATGATGAAA TTGAAAACTA 800 
CATCAAAAAT ACTGGTTATG TAGCTCCAGA CGAAATGTTT GTTGGTTATA 850 
CTCGCAAATA TTCAATGGCG GTATGGACAG GTTACTCAAA CCGCCTGACT 900 
CCTATCGTTG GTGATGGCTT CTATGTTGCA GCTAAGGTTT ACCGTTCAAT 950 

GATGACTTAT CTGTCTGAGG ATAACAACCC TGGCGACTGG ACTATGCCAG 1000 

AAGGTCTCTA TCGAAGTGGT GAGTTCGTCT TTAAAAAAGG TGCTCGTTCT 1050 

GCATGGACTG CTCCTGCTCC GCAACAGGCC CCAACACCAG AAAGTTCGAG 1100 

CTCGACATCA GAAAGTTCAA CTTCACAGTC AAGCTCAACT ACTCCAAGCA 1150 

CGAATAATAG TGCAAACAAT AATACCAATA ACCAGCAACC AAATACAACG 1200 

CCTGGTCAAC AAAACCAGAA CCAAA 1225 



2) INFORMATION FOR SEQ ID NO: 2059 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 554 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-38 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2059 

TGGCTACTTT GGAAACATTA CAGTCCAGTA TGCTCTTCAA CAATCACGAA 50 

ATGTCACAGC CGTTGAGACT TTGAATAAGG TCGGTCTAGA TAGAGCTAAA 100 

ACCTTCCTCA ATGGACTTGG TATCGATTAT CCAAGCATGC TTTATGCAAA 150 

CGCCATTTCA AGTAACACAA CTGAATCCAA CAAAAAGTAC GGAGCAAGTA 200 

GTGAAAAAAT GGCCGCTGCC TACGCAGCTT TTGCTAATGG TGGTACTTAC 250 

CACAAACCAA TGTATATCAA TAAAATCGTC TTTAGTGATG GTAGTGAAAA 3 00 

AGAATTTTCT GATGCCGGTA CTCGGGCTAT GAAAGAAACT ACTGCCTATA 350 

1068 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

TGATGACCGA AATGATGAAA ACTGTCTTAT 
GCCTACCTAC CTTGGCTTCC ACAAGCAGGT 
TACTGACGAA GAAATTGAAA AGTATATCAA 
CAGATGAAAT GTTTGTTGGT TATACCCGCA 
ACAG 



PCT/CA00/01150 

TATACGGAAC CGGACGTGGA 4 00 

AAGACAGGTA CTTCTAACTA 450 
GAATGCTGGT TACGTAGCTC . 500 
AATATGCAAT GGCTGTTTGG 550 

554 



2) INFORMATION FOR SEQ ID NO: 2060 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1249 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-57 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2060 



CAACTGGGAT GGATGTCTAC ACAAATGTAG 
CTGTGGGATA TTTACAATAC AGACGAATAC 
ATTGCAAGTC GCTTCTACCA TTGTTGATGT 
CCCAGCTAGG AGCACGCCAT CAGTCAAGTA 
CAAGCAGTAG AAACAAACCG CGACTGGGGA 
AGACTATGCT CCTGCCTTGG AGTACGGTGT 
TCGTTCACGA TGAGCCCTAT AACTACCCTG 
AACTGGGATA GGGGCTACTT TGGCAACATC 
ACAATCGCGA AACGTCCCAG CCGTGGAAAC 
ACCGCGCCAA GACTTTCCTA AATGGTCTAG 
CACTACTCAA ATGCCATTTC AAGTAACACA 
TGGAGCAAGT AGTGAAAAGA TGGCTGCTGC 
GTGGAACTTA CTATAAACCA ATGTATATCC 
GGGAGTGAAA AAGAGTTCTC TAATGTCGGA 
GACAGCCTAT ATGATGACCG ACATGATGAA 
CTGGACGAAA TGCCTATCTT GCTTGGCTCC 
ACCTCTAACT ATACAGACGA GGAAATTGAA 
ATTTGTAGCA CCTGATGAAC TATTTGCTGG 
TGGCTGTATG GACAGGCTAT TCTAACCGTC 
GGCCTTACGG TCGCTGCCAA AGTTTACCGC 
TGAAGGAAGC AATCCAGAAG ATTGGAATAT 
ATGGAGAATT CGTATTTAAA AATGGTGCTC 
GCTCCACAAC AACCCCCATC AACTGAAAGT 
TTCAACTTCA CAGTCTAGCT CAACCACTCC 
CTACCAATCC TAACAATAAT ACGCAACAAT 



ACCAAGAAGC 


TCAAAAACAT 


50 


GTTGCCTATC 


CAGACGATGA 


100 


TTCTAACGGT 


AAAGTCATTG 


150 


ATGTTTCCTT 


CGGAATTAAC 


200 


TCAACTATGA 


AACCGATCAC 


250 


CTACGATTCA 


ACTGCTACTA 


300 


GGACAAATAC 


TCCTGTTTAT 


350 


ACCTTGCAAT 


ACGCCCTGCA 


400 


TCTAAACAAG 


GTCGGACTCA 


450 


GAATCGACTA 


CCCAAGTATT 


. 500 


ACCGAATCAG 


ACAAAAAATA 


550 


TTACGCTGCC 


TTTGCAAATG 


600 


ATAAAGTCGT 


CTTTAGTGAT 


650 


ACTCGTGCCA 


TGAAGGAAAC 


700 


AACAGTCTTG 


ACTTATGGAA 


750 


CTCAGGCTGG 


TAAAACAGGA 


800 


AACCACATCA 


AGACCTCTCA 


850 


CTATACGCGT 


AAATATTCAA 


900 


TGACACCACT 


TGTAGGCAAT 


950 


TCTATGATGA 


CCTACCTGTC 


1000 


ACCAGAGGGG 


CTCTACAGAA 


1050 


GTTCTACGTG 


GAGCTCACCT 


1100 


TCAAGCTCAT 


CATCAGATAG 


1150 


AAGCACAAAT 


AATAGTACGA 


1200 


CAAATACAAC 


CCCTGATCA 


1249 



2) INFORMATION FOR SEQ ID NO: 2061 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 579 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-60 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2061 

GAAGTTGAAC AGACTGGTCA CCAAGCTCCT TCATATCCGC CAACACTGCA 
ATTTTCTTGC CACCTTCATT GGCTGGAATG GCAGAGAAAG TCTCTAAAAT 
CAGTTTCATA GCAGTTGGAT TGGCATTATA AACATCTGAC AGGATATCTG 
CTCCATTGGC TGCTTTCTTC CACTCGGTAC GGTTACGCGT CAATTCAAGA 
TGTTGGAAGG CCAAACGAAT TTGCTCCTCT GAAACTCCTT CTTGCAAGGC 
AACATAGGAT GCAATCATAG CATTTGTCGC ATTGTACTTG CCAGTTACTG 
GCAAATCAAG GGCTTGCTCT AAGAAATTGA CCTTGAAGGT CAGACTATCT 
TTGCGCTCAA CCAAGTCGGT AATTCCCAGC TCTGCTCCTT GACCAAAACG 
AACCACCTTT TTATCAGTTG GCAAATAGTC CTCTACGATA GGGTCAGCCG 
GCGCTAAAAG CAAGGAACCT GAAGCCATTC CGTCTGCAAT TTGCATTTTT 
CCTTTAGCAA TCTCAGAACG GTCTTTGAAA AAGGCCAAAT GAGCTTCTCC 
AACCAAGGTC ACGATGGCTG TATGGACAG 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
579 



2) INFORMATION FOR SEQ ID NO: 2062 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1216 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: StrR-63 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2062 

AAGCTCAAAA ACATCTGTGG GATATCTACA ACTCCGATCA ATACGTCTCT 50 

TACCCTGACG ATGATTTGCA AGTCGCATCT ACGGTCGTAG ATGTTTCAAA 100 

TGGTAAAGTC ATCGCACAAC TTGGTGCTCG TCATCAAGCA AGTAATGTTT 150 

CATTCGGTAC CAACCAGGCC GTAGAAACCA ATCGTGACTG GGGATCATCA 200 

ATGAAACCAA TCACTGACTA TGCTCCCGCT TTAGAATATG GAGTCTATGA 250 

CTCTACTGCT TCTATTGTAC ATGATGTCCC TTATAACTAT CCTGGCACTG 300 

ATACTCCACT CTACAACTGG GATCATGTCT ACTTTGGAAA CATTACAATC 350 

CAGTATGCTC TTCAACAATC ACGAAATGTC ACAGCCGTTG AGACTTTGAA 4 00 

TAAGGTCGGT CTAGATAGAG CTAAAACCTT CCTTAATGGT CTTGGTATCG 450 

ACTATCCAAG CATGCATTAT GCAAACGCCA TTTCAAGTAA CACAACTGAA 500 

TCCAACAAAA AATATGGTGC AAGTAGTGAA AAAATGGCTG CTGCCTACGC 550 

TGCTTTTGCT AATGGTGGTA TTTATCACAA AC CAATGTAT ATCAATAAAA 600 

TCGTCTTTAG TGATGGTAGC GAAAAAGAAT TTTCTGATGC TGGTACACGA 650 

GCTATGAAAG AGACTACTGC CTATATGATG ACTGAAATGA TGAAAACTGT 700 

TTTAACTTAC GGAACAGGAC GTGGAGCCTA CCTACCATGG CTTCCACAAG 750 
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CAGGTAAGAC AGGTACTTCT AACTATACTG 
ATCAAGAACA CTGGTTACGT AGCTCCAGAT 
CCGTAAATAT GCAATGGCTG TTTGGACAGG 
CAATCATCGG AGATGGTTTC CTTGTTGCTG 
ATAACTTACC TTTCTGAAGA TGACCAACCT 
TGGCTTGTAT AGAAATGGAG AATTCGTATT 
CGTGGAGCTC ACCTGCTCCA CAACAACCCC 
TCATCATCAG ATAGTTCAAC TTCACAGTCT 
AAATAATAGT ACGACTACCA ATCCTAACAA 
CAACCCCTGA TCAACA 



PCT/CA00/01150 

ACGAAGAAAT TGAAAAGTAT 800 

GAAATGTTTG TAGGGTATAC 850 

ATACTCAAAT CGTCTAACTC 900 

GTAAAGTCTA TCGTTCAATG 950 

GGAGATTGGA CAATGCCAGA 1000 

TAAAAATGGT GCTCGTTCTA 1050 

CATCAACTGA AAGTTCAAGC 1100 

AGCTCAACCA CTCCAAGCAC 1150 

TAATACGCAA CAATCAAATA 1200 

1216 



2) INFORMATION FOR SEQ ID NO: 2063 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 810 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: ATCC 700673 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2063 - 

AATACGTCTC TTACCCTGAC GATGATTTGC AAGTCGCATC CACGGTCGTA 50 

GATGTTTCAA ATGGTAAAGT CATCGCCCAA CTTGGAGCTC GTCACCAAGC 100 

AAGTAACGTT TCATTTGGTA CCAACCAAGC TGTGGAAACC AATCGTGACT 150 

GGGGTTCAAC AATGAAACCC ATCACCGATT ATGCACCTGC CATAGAATAC 2 00 

GGTGTATATG ATTCCACTGC AACTATGGTT AATGATATTC CTTATAACTA 250 

TCCGGGAACA AGCACACCTG TCTACAACTG GGATCGAGCA TATTTTGGTA 3 00 

ATATTACTCT GCAATATGCC CTTCAACAAT CTCGTAACGT ACCCGCCGTT 350 

GAGACACTAA ACAAGGTTGG TTTAGATAGA GCTAAAAGTT TCCTAAATGG 4 00 

TTTAGGAATC GACTATCCTG TAATGCACTA TTCAAATGCT ATTTCAAGTA 450 

ATACTACCGA ATCTAGTAAA CAGTACGGGG CAAGTAGTGA AAAAATGGCC 500 

ACTGCCTATG CCGCATTCGC AAACGGCGGT ATTTACCACA AACCAATGTA 550 

CATCAATAAG GTTGTCTTTA GCGATGGTAG CGAAAAAGAA TTTTCTGACC 600 

CTGGCACAAG AGCCATGAAA GAAACGACTG CTTACATGAT GACAGAGATG 650 

ATGAAAACAG TCTGGACTTA CGGAACTGGT CGCGGTGCCT ACCTACCTTG 700 

GCTTCCACAA GCAGGTAAAA CAGGTACTTC TAACTATACT GACGAAGAAA 750 

TTGAAAAGTA TATCAAGAAC ACTGGTTACG TAGCTCCAGA TGAAATGTTT 800 

GTAGGGTATA 810 



2) INFORMATION FOR SEQ ID NO: 2064 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 782 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: ATCC 700678 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2064 

TCTTACCCTG ACGATGATTT GCAAGTCGCA TCTACGGTCG TAGATGTTTC 50 

AAATGGTAAA GTCATCGCCC AACTTGGAGC TCGTCACCAA GCAAGTAACG 100 

TTTCATTTGG TACCAACCAA GCTGTGGAAA CCAATCGTGA CTGGGGATCA 150 

ACTATGAAAC CAATCACAGA CTATGCTCCT GCCTTGGAGT ACGGTGTCTA 200 

CGATTCAACT GCTACTATCG TTCACGATGA GCCCTATAAC TACCCTGGGA 250 

CAGATATCCC TCTCTATAAC TGGGATCGAG CATATTTCGG TAATATTACT '3 00 

CTGCAATATG CCCTTCAACA ATCTCGTAAC GTACCTGCCG TTGAAACACT 350 

AAACAAGGTC GGTCTAGATA AGGCTAAAAC CTTCCTTAAT GGTCTTGGTA 400 

TCGACTATCC AAGCATGCAT TATGCAAACG CCATTTCAAG TAATACAACT 450 

GAATCCAACA AAAAATATGG TGCAAGTAGT GAAAAAATAG CTACCGCCTA 500 

TGCCGCATTC GCAAATGGTG GTATTTACCA CAAACCAATG TACATCAATA 550 

AAGTTGTCTT TAGCGATGGT AGCGAAAAAG AATTTTCTGA CCCTGGCACA 600 

AGAGCCATGA AAGAAACGAC TGCTTACATG ATGACAGAAA TGATGAAAAC 650 

AGTCTGGACG TACGGAACTG GTCGTGGTGC CTACCTGCCT TGGCTTCCTC 700 

AAGCTGGTAA AACAGGTACC TCTAACTATA CTGACGAAGA AATTGAAAAG 750 

TATATCAAGA ACACTGGTTA CGTAGCTCCA GA 782 



2) INFORMATION FOR SEQ ID NO: 2065 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2065 
CCAGGACGTG GAGGCGATCA CA 22 



2) INFORMATION FOR SEQ ID NO: 2066 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2066 
CACCGACAGC GAGCCGATCA GA 22 
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2) INFORMATION FOR SEQ ID NO: 2067 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2067 
- AGCTGAGCCA ATTCATGG 18 



2) INFORMATION FOR SEQ ID NO: 2068 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

■i 4 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:. 1068 
ATTCATGGAC CAGAACAAC 19 



2) INFORMATION FOR SEQ ID NO: 2069 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2069 
CGCTGTCGGG GTTGACCC 18 



2) INFORMATION FOR SEQ ID NO: 2070 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

1073 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 

(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2070 
GTTGACCCAC AAGCGCCG 



2) INFORMATION FOR SEQ ID NO: 2071 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2071 
CGACTGTCGG CGCTGGGG 18 



2) INFORMATION FOR SEQ ID NO: 2072 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3534 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(B) STRAIN: Rv 

(C) ACCESSION NUMBER: L27989 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2072 

GTGCTGGAAG GATGCATCTT GGCAGATTCC CGCCAGAGCA AAACAGCCGC 
TAGTCCTAGT CCGAGTCGCC CGCAAAGTTC CTCGAATAAC TCCGTACCCG 
GAGCGCCAAA CCGGGTCTCC TTCGCTAAGC TGCGCGAACC ACTTGAGGTT 
CCGGGACTCC TTGACGTCCA GACCGATTCG TTCGAGTGGC TGATCGGTTC 
GCCGCGCTGG CGCGAATCCG CCGCCGAGCG GGGTGATGTC AACCCAGTGG 
GTGGCCTGGA AGAGGTGCTC TACGAGCTGT CTCCGATCGA GGACTTCTCC 
GGGTCGATGT CGTTGTCGTT CTCTGACCCT CGTTTCGACG ATGTCAAGGC 
ACCCGTCGAC GAGTGCAAAG ACAAGGACAT GACGTACGCG GCTCCACTGT 
TCGTCACCGC CGAGTTCATC AACAACAACA CCGGTGAGAT CAAGAGTCAG 
ACGGTGTTCA TGGGTGACTT CCCGATGATG ACCGAGAAGG GCACGTTCAT 
CATCAACGGG ACCGAGCGTG TGGTGGTCAG CCAGCTGGTG CGGTCGCCCG 
GGGTGTACTT CGACGAGACC ATTGACAAGT CCACCGACAA GACGCTGCAC 
AGCGTCAAGG TGATCCCGAG CCGCGGCGCG TGGCTCGAGT TTGACGTCGA 
CAAGCGCGAC ACCGTCGGCG TGCGCATCGA CCGCAAACGC CGGCAACCGG 
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TCACCGTGCT GCTCAAGGCG CTGGGCTGGA CCAGCGAGCA GATTGTCGAG 750 

CGGTTCGGGT TCTCCGAGAT CATGCGATCG ACGCTGGAGA AGGACAACAC 800 

CGTCGGCACC GACGAGGCGC TGTTGGACAT CTACCGCAAG CTGCGTCCGG 850 

GCGAGCCCCC GACCAAAGAG TCAGCGCAGA CGCTGTTGGA AAACTTGTTC 900 

TTCAAGGAGA AGCGCTACGA CCTGGCCCGC GTCGGTCGCT ATAAGGTCAA 950 

CAAGAAGCTC GGGCTGCATG TCGGCGAGCC CATCACGTCG TCGACGCTGA 1000 

CCGAAGAAGA CGTCGTGGCC ACCATCGAAT ATCTGGTCCG CTTGCACGAG 1050 

GGTCAGACCA CGATGACCGT TCCGGGCGGC GTCGAGGTGC CGGTGGAAAC 1100 

CGACGACATC GACCACTTCG GCAACCGCCG CCTGCGTACG GTCGGCGAGC 1150 

TGATCCAAAA CCAGATCCGG GTCGGCATGT CGCGGATGGA GCGGGTGGTC 1200 

CGGGAGCGGA TGACCACCCA GGACGTGGAG GCGATCACAC CGCAGACGTT 1250 

GATCAACATC CGGCCGGTGG TCGCCGCGAT CAAGGAGTTC TTCGGCACCA 1300 

GCCAGCTGAG CCAATTCATG GACCAGAACA ACCCGCTGTC GGGGTTGACC 1350 

CACAAGCGCC GACTGTCGGC GCTGGGGCCC GGCGGTCTGT CACGTGAGCG 1400 

TGCCGGGCTG GAGGTCCGCG ACGTGCACCC GTCGCACTAC GGCCGGATGT 1450 

GCCCGATCGA AACCCCTGAG GGGCCCAACA TCGGTCTGAT CGGCTCGCTG 1500 

TCGGTGTACG CGCGGGTCAA CCCGTTCGGG TTCATCGAAA CGCCGTACCG 1550 

CAAGGTGGTC GACGGCGTGG TTAGCGACGA GATCGTGTAC CTGACCGCCG 1600 

ACGAGGAGGA CCGCCACGTG GTGGCACAGG CCAATTCGCC GATCGATGCG 1650 

GACGGTCGCT TCGTCGAGCC GCGCGTGCTG GTCCGCCGCA AGGCGGGCGA 1700 

GGTGGAGTAC GTGCCCTCGT CTGAGGTGGA CTACATGGAC GTCTCGCCCC 1750 

GCCAGATGGT GTCGGTGGCC ACCGCGATGA TTCCCTTCCT GGAGCACGAC 1800 

GACGCCAACC GTGCCCTCAT GGGGGCAAAC ATGCAGCGCC AGGCGGTGCC 1850 

GCTGGTCCGT AGCGAGGCCC CGCTGGTGGG CACCGGGATG GAGCTGCGCG 1900 

CGGCGATCGA CGCGGCGACG TCGTCGTCGC AAGAAAGCGG CGTCATCGAG 1950 

GAGGTGTCGG CCGACTACAT CACTGTGATG CACGACAACG GCACCCGGCG 2000 

TACCTACCGG ATGCGCAAGT TTGCCCGGTC CAACCACGGC ACTTGCGCCA 2050 

ACCAGTGCCC CATCGTGGAC GCGGGCGACC GAGTCGAGGC CGGTCAGGTG 2100 

ATCGCCGACG GTCCCTGTAC TGACGACGGC GAGATGGCGC TGGGCAAGAA 2150 

CCTGCTGGTG GCCATCATGC CGTGGGAGGG CCACAACTAC GAGGACGCGA 2200 

TCATCCTGTC CAACCGCCTG GTCGAAGAGG ACGTGCTCAC CTCGATCCAC 2250 

ATCGAGGAGC ATGAGATCGA TGCTCGCGAC ACCAAGCTGG GTGCGGAGGA 23 00 

GATCACCCGC GACATCCCGA ACATCTCCGA CGAGGTGCTC GCCGACCTGG 2350 

ATGAGCGGGG CATCGTGCGC ATCGGTGCCG AGGTTCGCGA CGGGGACATC 2400 

CTGGTCGGCA AGGTCACCCC GAAGGGTGAG ACCGAGCTGA CGCCGGAGGA 2450 

GCGGCTGCTG CGTGCCATCT TCGGTGAGAA GGCCCGCGAG GTGCGCGACA 2500 

CTTCGCTGAA GGTGCCGCAC GGCGAATCCG GCAAGGTGAT CGGCATTCGG 2550 

GTGTTTTCCC GCGAGGACGA GGACGAGTTG CCGGCCGGTG TCAACGAGCT 2600 

GGTGCGTGTG TATGTGGCTC AGAAACGCAA GATCTCCGAC GGTGACAAGC 2650 

TGGCCGGCCG GCACGGCAAC AAGGGCGTGA TCGGCAAGAT CCTGCCGGTT 2700 

GAGGACATGC CGTTCCTTGC CGACGGCACC CCGGTGGACA TTATTTTGAA 2750 

CACCCACGGC GTGCCGCGAC GGATGAACAT CGGCCAGATT TTGGAGACCC 2800 

ACCTGGGTTG GTGTGCCCAC AGCGGCTGGA AGGTCGACGC CGCCAAGGGG 2850 

GTTCCGGACT GGGCCGCCAG GCTGCCCGAC GAACTGCTCG AGGCGCATGC 2900 

GAACGCCATT GTGTCGACGC CGGTGTTCGA CGGCGCCCAG GAGGCCGAGC 2950 

TGCAGGGCCT GTTGTCGTGC ACGCTGCCCA ACCGCGACGG TGACGTGCTG 3000 

GTCGACGCCG ACGGCAAGGC CATGCTCTTC GACGGGCGCA GCGGCGAGCC 3050 

GTTCCCGTAC CCGGTCACGG TTGGCTACAT GTACATCATG AAGCTGCACC 3100 

ACCTGGTGGA CGACAAGATC CACGCCCGCT CCACCGGGCC GTACTCGATG 3150 

ATCACCCAGC AGCCGCTGGG CGGTAAGGCG CAGTTCGGTG GCCAGCGGTT 3200 

CGGGGAGATG GAGTGCTGGG CCATGCAGGC CTACGGTGCT GCCTACACCC 3250 

TGCAGGAGCT GTTGACCATC AAGTCCGATG ACACCGTCGG CCGCGTCAAG 33 00 

GTGTACGAGG CGATCGTCAA GGGTGAGAAC ATCCCGGAGC CGGGCATCCC 33 50 

CGAGTCGTTC AAGGTGCTGC TCAAAGAACT GCAGTCGCTG TGCCTCAACG 3400 

TCGAGGTGCT AT CG AGTGAC GGTGCGGCGA TCGAACTGCG CGAAGGTGAG 3450 

GACGAGGACC TGGAGCGGGC CGCGGCCAAC CTGGGAATCA ATCTGTCCCG 3500 

CAACGAATCC GCAAGTTTCG AGGATCTTGC GTAA 3534 
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2) INFORMATION FOR SEQ ID NO: 2073 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2073 
CCGAGCAACA TGATTGAACC ATCCACCAAC TGGCTCGG 38 



2) INFORMATION FOR SEQ ID NO: 2074 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2074 
CCGAGCAACA TGATTGAAGC TTCCACCAAC TGGCTCGG 38 



2) INFORMATION FOR SEQ ID NO: 2075 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2075 
CCGAGCCAGG TTCTGAAGTC TCTGCATTAT TAGGTGCTCG G 41 



2) INFORMATION FOR SEQ ID NO: 2076 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear V - 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2076 
CCGAGCYGAY AACATTTTCA GATTCACCCA RGCGCTCGG 39 



2) INFORMATION FOR SEQ ID NO: 2077 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2077 
CCGAGCAACC GATCCAGCTC CAGCTACGCT CGG 33 



2) INFORMATION FOR SEQ ID NO: 2078 

(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 34 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2078 
CCGAGCCTTG GTCTTCGGCC AAATGAACGC TCGG 34 



2) INFORMATION FOR SEQ ID NO: 2079 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2079 
CCGAGCGTTC AGTTACTTCA GTCCAAGCCG GCTCGG 36 
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2) INFORMATION FOR SEQ ID NO: 2080 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2080 
CCGAGCCGAA GAGGGCCAAG ATGTCGCTCG G 31 



2) INFORMATION FOR SEQ ID NO: 2081 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2081 
GRATYRTYAA AGTTGGTGAG GAAG 24 



2) INFORMATION FOR SEQ ID NO: 2082 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2082 
CMACTTCATC YCGCTTCGTA CC 22 



2) INFORMATION FOR SEQ ID NO: 2083 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 4 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2083 
CCGCCGATGT TCCGTAAATT ACTTGAIGAA GGTCGAGCCG GCGG 44 



2) INFORMATION FOR SEQ ID NO: 2084 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2084 
CACGCGTCAA CACCCGTACA AGTCGTCTTT TGCGCGTG 38 



2) INFORMATION FOR SEQ ID NO: 2085 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 bases 
.(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2085 
CAAACTAAAG AACATATCTT GCTA 24 



2) INFORMATION FOR SEQ ID NO: 2086 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2086 
ATATAATTTG CATCACCTTC AAG 23 
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2) INFORMATION FOR SEQ ID NO: 2087 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2087 
TCAGCTCGTG GGATTAGGAG AG 



2) INFORMATION FOR SEQ ID NO: 2088 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2088 
AGGCTTCACG CTGTTAGGCT GA 



2) INFORMATION FOR SEQ ID NO: 2089 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear . 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2089 
ATGCTGAACT TATTGACCTT 



2) INFORMATION FOR SEQ ID NO: 2090 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO*: 2090 
CGTTACTGGA GTCGAAATG 



2) INFORMATION FOR SEQ ID NO: 2091 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2091 
CGCGACTTGA GATGGAACTT AGTGAGCTTC TTGGTCGCG 



2) INFORMATION FOR SEQ ID NO: 2092 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2092 
CGCGACGAAA GAACTTCCTG AAGGTCGTGC AGGTCCAG 



2) INFORMATION FOR SEQ ID NO: 2093 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2093 
TGTTGGCAAT CGAAGACACC 
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2) INFORMATION FOR SEQ ID NO: 2094 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2094 

TTCAATTTCT TGACCTACTT TCAA 24 
2) INFORMATION FOR SEQ ID NO: 2095 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2095 
CGGTCGGGTT GAACGTGG 18 



2) INFORMATION FOR SEQ ID NO: 2096 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2096 
CGCGACCGGT ACCACGGCCA GTAATCGTGT CGCG 34 
2) INFORMATION FOR SEQ ID NO: 2097 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1185 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Mycoplasma pneumoniae 

(B) STRAIN: ATCC 29342 

(C) ACCESSION NUMBER: AE000019 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2097 

ATGGCAAGAG AGAAATTTGA CCGATCTAAA CCCCACGTTA ATGTAGGTAC 50 

TATTGGCCAC ATTGACCACG GTAAAACAAC TTTAACAGCA GCTATTTGTA 100 

CTGTATTAGC AAAAGAAGGT AAATCAGCTG CTACTCGTTA CGACCAAATC 150 

GATAAGGCTC CGGAAGAAAA AGCACGGGGA ATTACGATTA ACTCCGCTCA 200 

CGTGGAGTAC TCCTCTGACA AGCGTCACTA TGCTCACGTT GACTGTCCAG 250 

GACACGCTGA CTACATTAAG AACATGATTA CTGGTGCTGC ACAAATGGAT 3 00 

GGTGCCATTC TAGTAGTTTC AGCAACTGAC AGTGTTATGC CCCAAACCCG 350 

TGAACACATT TTGTTGGCCC GCCAAGTGGG TGTGCCACGC ATGGTAGTGT 400 

TCCTAAACAA GTGTGACATT GCAACTGATG AAGAAGTGCA AGAGTTAGTA 450 

GCAGAAGAGG TACGTGACTT ATTAACTTCT TACGGCTTTG ATGGCAAGAA 500 

CACCCCTATT ATTTATGGTT CTGCACTTAA AGCGCTTGAA GGTGATCCTA 550 

AGTGGGAAGC TAAGATCCAT GATTTAATGA ATGCAGTTGA TGAATGGATT 6 00 

CCAACTCCTG AACGTGAAGT GGACAAACCC TTCTTGTTGG CAATCGAAGA 650 

CACCATGACG ATTACTGGCC GTGGTACCGT GGTTACCGGT CGGGTTGAAC 700 

GTGGTGAATT GAAAGTAGGT CAAGAAATTG AAATCGTTGG TTTACGTCCA 750 

ATCCGTAAAG CAGTTGTTAC CGGAATCGAA ATGTTCAAAA AGGAACTTGA 800 

TTCAGCAATG GCTGGGGACA ACGCTGGGGT ATTACTCCGT GGTGTGGACC 850 

GTAAAGAAGT GGAACGTGGT CAAGTGTTAG CTAAACCAGG TTCGATTAAA 900 

CCGCACAAGA AATTTAAAGC GGAAATCTAT GCTTTAAAGA AGGAAGAAGG 950 

TGGTCGTCAC ACCGGTTTCT TAAACGGTTA CCGTCCCCAA TTCTACTTCC 1000 
GTACTACAGA CGTTACTGGT TCGATTTCCC TACCAGAAAA CACCGAAATG ■ 1050 

GTGCTACCAG GTGACAATAC CT CGATTAC A GTTGAACTAA TTGCACCAAT 1100 

TGCTTGTGAA AAAGGTAGTA AGTTCTCCAT CCGTGAAGGT GGTCGAACGG 1150 

TTGGTGCTGG TTCAGTCACG GAAGTGCTTG AATAG 1185 



2) INFORMATION FOR SEQ ID NO: 2098 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2098 
CTGAGTCACA CCGACAAACG TC 



2) INFORMATION FOR SEQ ID. NO: 2099 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2099 
CCAGGACTGA ACGGGATACG AA 22 



2) INFORMATION FOR SEQ ID NO: 2100 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2100 
GCGAGACGAT AGGTTGTC 18 



2) INFORMATION FOR SEQ ID NO: 2101 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2609 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(B) STRAIN: H37Rv 

(C) ACCESSION NUMBER: Z79701 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2101 

CAGCCCGCGA GCGTAACCTG GCTGCGATTT CCGGCGCGGA TTTTCGCAGT 50 

GCGGTTACGC TCGGAAAGCG CGGGCCTCGC CCACGCGGCG GATGATGTCA 100 

GCGGGGTGGT CCTCGGCGAC GACCCGGACC ACGATCCACC CGTAGCGGTG 150 

CTGGACTTTC TCGTGCCGGA GGATGTCTTT CCGGTAGTGG TAGCGACTGG 200 

TCAGATGGTG GTCGCCGTCA TACTCGGCCG CGACCTTGAT GTCTTGCCAG 250 

CCCATATCCA AATGGGCTTC CGCCCAGCCC CATTCGTTGC GCACCGCGAT 3 00 

CTGCGTCTGG GGGCGCGGAA AGCCGGCGCG GATCAACAAC AAGCGCAGCC 350 

AGGTTTCCTT GGGGGACTGG GCACCGCCGT CGACGAGGTC CAGAGCGGCT 400 

CTTGCGGCCT TCATGCCACG GCGGCCCCGA TAGCGCTCGA TCAGCGGCTC 450 

GACGTCGGCC ACCTTCAAAT CGGTGGCCTG TATCAGGGCG TCGACGGCCG 500 

CGACGGCGGG GTCCAATGGA AATCGACTGG TCAGGTCGAG CGCCGTTCGC 550 

TCCGGTGTGG TCACGCGCAT GCCCTCGATG ACGCAGATCT CGTCGGGCTC 600 

GATGCGCTCT TCCCAGACTT GCAGCCCCGG GGCACGGCGG CGGTTGGTGT 650 

CGATGATCGC GGCGGGAAGA TCCGCGTCGA TCCACTTGGC GCCATGGAAG 700 

GCAGAAGCCG AGTAGCCGGC CAGCACGCCG CGGCGGCGCG AGCGCAGCCA 750 

CAGCGCTTTT GCACGCAATT GCGCGGTCAG TTCCACACCC TGCGGCACGT 800 
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ACACGTCTTT ATGTAGCGCG ACATACCTGC TGCGCAATTC GTAGGGCGTC 850 
AATACACCCG CAGCCAGGGC CTCGCTGCCC AGAAAGGGAT CCGTCATGGT 900 
CGAAGTGTGC TGAGT CACAC CGACAAACGT CACGAGCGTA ACCCCAGTGC 950 

GAAAGTTCCC GCCGGAAATC GCAGCCACGT TACGCTCGTG GACATACCGA 1000 

TTTCGGCCCG GCCGCGGCGA GACGATAGGT TGTCGGGGTG ACTGCCACAG 1050 

CCACTGAAGG GGCCAAACCC CCATTCGTAT CCCGTTCAGT CCTGGTTACC 1100 

GGAGGAAACC GGGGGATCGG GCTGGCGATC GCACAGCGGC TGGCTGCCGA 1150 

CGGCCACAAG GTGGCCGTCA CCCACCGTGG ATCCGGAGCG CCAAAGGGGC 1200 

TGTTTGGCGT CGAATGTGAC GTCACCGACA GCGACGCCGT CGATCGCGCC 1250 

TTCACGGCGG TAGAAGAGCA CCAGGGTCCG GTCGAGGTGC TGGTGTCCAA 13 00 

CGCCGGCCTA TCCGCGGACG CATTCCTCAT GCGGATGACC GAGGAAAAGT 1350 

TCGAGAAGGT CATCAACGCC AACCTCACCG GGGCGTTCCG GGTGGCTCAA 1400 

CGGGCATCGC GCAGCATGCA GCGCAACAAA TTCGGTCGAA TGATATTCAT 1450 

AGGTTCGGTC TCCGGCAGCT GGGGCATCGG CAACCAGGCC AACTACGCAG 1500 

CCTCCAAGGC CGGAGTGATT GGCATGGCCC GCTCGATCGC CCGCGAGCTG 1550 

TCGAAGGCAA ACGTGACCGC GAATGTGGTG GCCCCGGGCT ACATCGACAC 1600 

CGATATGACC CGCGCGCTGG ATGAGCGGAT TCAGCAGGGG GCGCTGCAAT 1650 

TTATCCCAGC GAAGCGGGTC GGCACCCCCG CCGAGGTCGC CGGGGTGGTC 1700 

AGCTTCCTGG CTTCCGAGGA TGCGAGCTAT ATCTCCGGTG CGGTCATCCC 1750 

GGTCGACGGC GGCATGGGTA TGGGCCACTG ACACAACACA AGGACGCACA 1800 

TGACAGGACT GCTGGACGGC AAACGGATTC TGGTTAGCGG AATCATCACC 1850 

GACTCGTCGA TCGCGTTTCA CATCGCACGG GTAGCCCAGG AGCAGGGCGC 1900 

CCAGCTGGTG CTCACCGGGT TCGACCGGCT GCGGCTGATT CAGCGCATCA 1950 

CCGACCGGCT GCCGGCAAAG GCCCCGCTGC TCGAACTCGA CGTGCAAAAC 2000 

GAGGAGCACC TGGCCAGCTT GGCCGGCCGG GTGACCGAGG CGATCGGGGC 2050 

GGGCAACAAG CTCGACGGGG TGGTGCATTC GATTGGGTTC ATGCCGCAGA 2100 

CCGGGATGGG CATCAACCCG TTCTTCGACG CGCCCTACGC GGATGTGTCC 2150 

AAGGGCATCC ACATCTCGGC GTATTCGTAT GGTTGGATGG CCAAGGCGCT 2200 

GCTGCCGATC ATGAACCCCG GAGGTTCCAT CGTCGGCATG GACTTCGACC 2250 

CGAGCCGGGC GATGCCGGCC TACAACTGGA TGACGGTCGC CAAGAGCGCG 23 00 

TTGGAGTCGG TCAACAGGTT CGTGGCGCGC GAGGCCGGCA AGTACGGTGT 2350 

GCGTTCGAAT CTCGTTGCCG CAGGCCCTAT CCGGACGCTG GCGATGAGTG 24 00 

CGATCGTCGG CGGTGCGCTC GGCGAGGAGG CCGGCGCCCA GATCCAGCTG 2450 

CTCGAGGAGG GCTGGGATCA GCGCGCTCCG ATCGGCTGGA ACATGAAGGA 2500 

TGCGACGCCG GTCGCCAAGA CGGTGTGCGC GCTGCTGTCT GACTGGCTGC 2550 

CGGCGACCAC GGGTGACATC ATCTACGCCG ACGGCGGCGC GCACACCCAA 2600 

TTGCTCTAG 2609 



2) INFORMATION FOR SEQ ID NO: 2102 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2102 
CACCTTCACC CTGACCGACG 



2) INFORMATION FOR SEQ ID NO: 2103 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2103 
CGAACCAGCG GAAATAGTTG GAC 23 



2) INFORMATION FOR SEQ ID NO: 2104 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2104 
CTGGGCATGG CICGAGTC .18 



2) INFORMATION FOR SEQ ID NO: 2105 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 97 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium tuberculosis 

(B) STRAIN: H37rv 

(C) ACCESSION NUMBER: U68480 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2105 

ATGACACAGT GCGCGAGCAG ACGCAAAAGC ACCCCAAATC GGGCGATTTT 50 

GGGGGCTTTT GCGTCTGCTC GCGGGACGCG CTGGGTGGCC ACCATCGCCG 100 

GGCTGATTGG CTTTGTGTTG TCGGTGGCGA CGCCGCTGCT GCCCGTCGTG 150 

CAGACCACCG CGATGCTCGA CTGGCCACAG CGGGGGCAAC TGGGCAGCGT 200 

GACCGCCCCG CTGATCTCGC TGACGCCGGT CGACTTTACC GCCACCGTGC 250 

CGTGCGACGT GGTGCGCGCC ATGCCACCCG CGGGCGGGGT GGTGCTGGGC 300 

ACCGCACCCA AGCAAGGCAA GGACGCCAAT TTGCAGGCGT TGTTCGTCGT 350 

CGTCAGCGCC CAGCGCGTGG ACGTCACCGA CCGCAACGTG GTGATCTTGT 400 

CCGTGCCGCG CGAGCAGGTG ACGTCCCCGC AGTGTCAACG CATCGAGGTC 450 
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ACCTCTACCC ACGCCGGCAC 
GTCGGGCGCG CCGCTGCGCA 
AGATTGTCGG GGTGTTCACC 
GCGGTCTCGG CGACCATCGA 
GAAACTGCTG GCGATCATCG 
TCGCGTTGTG GCGCCTGGAC 
CTCCTCCTCA GGCCGTTCCG 
GCTGATTCCG GCAAGCTGGC 
TATTCGGCTT CCTGCTCTGG 
GGCTACATCC TGGGCATGGC 
CAACTATTTC CGCTGGTTCG 
ACAACCTGCT GGCGCTGATG 
CGCCTGCCAG ACCTGGCCGC 
TGAGGTGCTG CCCCGCCTCG 
ACTGGGCGGC GGCCATGGTC 
GGCCTGCGGC CGGAGGGCAT 
GCTGATCGAG CGGTCCATGC 
CCGTCGTTAC CGCCGCATTC 
GCGGTGGCCG GGCTGGTGGC 
GCGCCGYCAT CGCCTGGTCG 
CCGCCGGCAC CGTCATCCTG 
ACGGTGTTGG AAGCCACCAG 
GTGGTATACC GAGAACCTGC 
ACGGTTCGCT GTCGCGGCGC 
TTCACCGCGG TGTTCATCAT 
CCGCGGACCG GCGTGGCGGC 
TCCTGATGTT CACGCCCACC 
GCCGTAGGGG CGGCGATGGC 
GGTGCTGCGC TGGTCGCGCA 
TCCTGCTGGC GTTGTGTTGG 
AGCTACGGTG TGCCGTTCAA 
AGTCAGCACA ATCTTTTTCG 
CCTGGCTGCA CTTCGCGCCC 
GCGCTGACGA CAGCCCCGGT 
GTTCGTCGCG TCCATGGTGG 
CCAACGGCTG GTCCAACGTG 
GACGACGTAC TCGTCGAGCC 
GGACGGCGAT TCGGGTTCTT 
ACCCGGTCGG CTTCACGCCC 
GCGATCGTGA TGAAACCCAA 
GCCGACCAAG CTGACGAGTC 
CCTATGGGCT CGATCCCGCC 
GGCGCACAGC AACAGAGCAC 
GCCGGACGAC GGGCATCCGC 
CCGGCAACAG CGTGCTGCAC 
GAATACGCCA TGCCGGGACC 
GCCCGACGAC CTATACGGAG 
TCGCCCGAGC AAAGATGCCC 
GAGGATCTGT CGCTGACACC 
GGTACCGGAC CTGCGCTCAC 
TGCTGCTGGA CTGGGCGGTC 
CTGCACGCCA ATGGCATCGC 
CTACTCGGCT AAGAAGCTGG 
GCGGCCTGCT CGGGATCACC 
ACCTACCTGT CCCGCGACTG 
CGACACCCTG GTCGATGCCC 
CCCGCAGCGG CCTGTGGTCA 



CTTCGCCAAC TTCGTCGGGC 
GCGGCTTCCC CGACCCCAAC 
GACCTGACCG GGCCCGCGCC 
CACCCGGTTC TCCACCCGGC 
GGGCGATCGT GGCCACCGTC 
CAGTTGGACG GGCGGGGCTC 
GCCTGCATCG TCGCCGGGCG 
GCACCTTCAC CCTGACCGAC 
CATGTCATCG GCGCGAATTC 
CCGAGTCGCC GACCACGCCG 
GCAGCCCGGA GGATCCCTTC 
ACCCATGTCA GCGACGCCAG 
CGGGCTAGTG TGCTGGCTGC 
GGCCGGCGGT GGAGGCCAGC 
TTGCTGACCG CGTGGATGCC 
CATCGCGCTC GGCTCGCTGG 
GGTACAGCCG GCTCACACCG 
ACACTGGGTG TGCAGCCCAC 
CGGCGGCCGC CCGATGCTGC 
GCACGTTGCC GTTGGTGTCG 
ACCGTGGTGT TCGCCGACCA 
GGTTCGCGCC AAAATCGGGC 
GTTACTACTA CCTCATCCTG 
TTCGGCTTTT TGATCACCGC 
GTTGCGGCGC AAGCGAATTC 
TGATGGGCGT CATCTTCGGC 
AAGTGGGTGC ACCACTTCGG 
CGCGCTGACG ACGGTGTTGG 
ACCGGATGGC GTTCCTGGCG 
GCCACCACCA ACGGCTGGTG 
CAGCGCGATG CCGAAGATCG 
CCCTGTTTGC GATCGCCGCC 
CGCGGCGCCG GCGAAGGGCG 
ACCGATCGTG GCCGGTTTCA 
CCGGGATCGT GCGACAGTAC 
CGGGCGTTTG TCGGCGGCTG 
TGATACCAAT GCGGGTTTCA 
GGGGCCCCTT GGGCCCGCTG 
AACGGCGTAC CGGAACACAC 
CCAGCCCGGC ACCGACTACG 
CTGGCATCAA TGGTTCTACG 
CGGGTACCGT TGGCAGGCAC 
ACTCGTCTCG GCGTGGTATC 
TGGTCGTGGT GACCGCCGCG 
GGGTACACCC CCGGGCAGAC 
CGGAGCGCTG GTACCCGCCG 
AGCAGCCCAA GGCGTGGCGC 
GCCGATGCCG TCGCGGTCCG 
GGAGGACTGG ATCGCGGTGA 
TGCAGGAATA TGTGGGCTCG 
GGTTTGGCCT TCCCGTGCCA 
CGAAATCCCG AAGTTCCGCA 
ACACCGACAC GTGGGAAGAC 
GACCTGTTGC TGCGGGCCCA 
GGCCCGCGAT TGGGGTTCCC 
CTCCCGCCCA GCTCGAGTTG 
CCGGGCAAGA TCCGAATTGG 
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2) INFORMATION FOR SEQ ID NO: 2106 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2106 

GCGAGCAGAG CACGCCCTCC TCGCCGCTCG C 31 
2) INFORMATION FOR SEQ ID NO: 2107 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 32 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2107 
GCGAGCTCCC CATCTCTGGT TGGCACGCTC GC 32 



2) INFORMATION FOR SEQ ID NO: 2108 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2108 
GCGGGCAACT TCRTCAAGAA GGTTGGTTAC AACCCGCCCG C 41 



2) INFORMATION FOR SEQ ID NO: 2109 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2109 
GCGGGCCCAA TCTCTGGTTG GAAYGGTGAC AAGCCCGC 



2) INFORMATION FOR SEQ ID NO: 2110 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2110 
GCGGGCCCTT AACGATTTCA GCGAATCTGG ATTCAGCCCG C 



2) INFORMATION FOR SEQ ID NO: 2111 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single " 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2111 
GCGGGCATGA TTGAAGCCAC CACCAACGCT TCCTGGCCCG C 



2) INFORMATION FOR SEQ ID NO: 2112 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 43 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2112 
GCGGGCTTGA TGAAGTTTTG GGTTTCCTTG ACAATTTGCC CGC 



2) INFORMATION FOR SEQ ID NO: 2113 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2113 
GCGGGCACAA GGGTTGGACT AAGGAAACCA AGGCAGCCCG C 41 



2) INFORMATION FOR SEQ ID NO: 2114 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2114 
GCGGGCATCG ATGCTATTGA ACCACCTGTC AGACCGCCCG C 41 
2) INFORMATION FOR SEQ ID NO: 2115 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2115 
GCGGGCTTGA TGATTTCCTC GAATCTAGAT TGGGCCCGC 39 



2) INFORMATION FOR SEQ ID NO: 2116 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2116 

GCGGGCGGTA AGTCCACCGG TAAGACCTTG TTGGCCCGC 39 
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2) INFORMATION FOR SEQ ID NO: 2117 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2117 
GCGGGCGACG CCATTGAGCC ACCTTCGAGA GCCCGC 36 



2) INFORMATION FOR SEQ ID NO: 2118 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2118 
GCGGGCTCCT TGACAATTTC TTCGTATCTG TTCTTGGCCC GC 42 



2) INFORMATION FOR SEQ ID NO: 2119 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2119 
GCGGGCTTAC AACCCTAAGG CTGTTCCATT CGTTGCCCGC 40 



2) INFORMATION FOR SEQ ID NO: 2120 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2120 
GCGGGCAGAA ACCAAGGCTG GTAAGGTTAC CGGTAGCCCG C 41 



2) INFORMATION FOR SEQ ID NO: 2121 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2121 
CGACCGCIAG CCGCACACCA AGTTCCGGTC G 31 
2) INFORMATION FOR SEQ ID NO: 2122 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2122 
CCGAGCGAAT GTAGGAGTCC AGGGTCTCTG CTCGG 35 



2) INFORMATION FOR SEQ ID NO: 2123 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2123 
TCTACCACTG AAGCATTAC 19 
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2) INFORMATION FOR SEQ ID NO: 2124 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2124 
TAGGTACTGT AGGTTTATTG 



2) INFORMATION FOR SEQ ID NO: 2125 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2125 
CACGCGGATT TTGAATCTCT TCCTCTAGTA GCGCGTG 



2) INFORMATION FOR SEQ ID NO: 2126 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2126 
ATATCAGAGA CTGATGAG 



2) INFORMATION FOR SEQ ID NO: 2127 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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( D) TOPOLOGY : ' Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2127 
TAGCATATTC AGAGAATATT GT 



2) INFORMATION FOR SEQ ID NO: 2128 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2128 
CAACGCTGGA GAATCTATAT TTGTAGAAAC TGCGTTG 
2) INFORMATION FOR SEQ ID NO: 2129 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8133 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 

(B) STRAIN: VPI 10463 

(C) ACCESSION NUMBER: X51797 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2129 

ATGTCTTTAA TATCTAAAGA AGAGTTAATA AAACTCGCAT ATAGCATTAG 50 

ACCAAGAGAA AATGAGTATA AAACTATACT AACTAATTTA GACGAATATA 100 

ATAAGTTAAC TACAAACAAT AATGAAAATA AATATTTGCA ATTAAAAAAA 150 

CTAAATGAAT CAATTGATGT TTTTATGAAT AAATATAAAA CTTCAAGCAG 200 

AAATAGAGCA CTCTCTAATC TAAAAAAAGA TATATTAAAA GAAGTAATTC 250 

TTATTAAAAA TTCCAATACA AGCCCTGTAG AAAAAAATTT ACATTTTGTA 3 00 

TGGATAGGTG GAGAAGTCAG TGATATTGCT CTTGAATACA TAAAACAATG 3 50 

GGCTGATATT AATGCAGAAT ATAATATTAA ACTGTGGTAT GATAGTGAAG 400 

CATTCTTAGT AAATACACTA AAAAAGGCTA TAGTTGAATC TTCTACCACT 450 

GAAGCATTAC AGCTACTAGA GGAAGAGATT CAAAATCCTC AATTTGATAA 500 

TATGAAATTT TACAAAAAAA GGATGGAATT TATATATGAT AGACAAAAAA 550 

GGTTTATAAA TTATTATAAA TCTCAAATCA ATAAACCTAC AGTACCTACA 600 

ATAGATGATA TTATAAAGTC TCATCTAGTA TCTGAATATA ATAGAGATGA 650 

AACTGTATTA GAATCATATA GAACAAATTC TTTGAGAAAA ATAAATAGTA 700 

ATCATGGGAT AGATATCAGG GCTAATAGTT TGTTTACAGA ACAAGAGTTA 750 

TTAAATATTT ATAGTCAGGA GTTGTTAAAT CGTGGAAATT TAGCTGCAGC 800 
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ATCTGACATA GTAAGATTAT TAGCCCTAAA AAATTTTGGC 
TAGATGTTGA TATGCTTCCA GGTATTCACT CTGATTTATT 
TCTAGACCTA GCTCTATTGG ACTAGACCGT TGGGAAATGA 
GGCTATTATG AAGTATAAAA AATATATAAA TAATTATACA 
TTGATAAACT TGATCAACAA TTAAAAGATA ATTTTAAACT 
AGTAAAAGTG AAAAATCTGA GATATTTTCT AAATTAGAAA 
ATCTGATCTT GAAATTAAAA TAGCTTTCGC TTTAGGCAGT 
AAGCCTTGAT ATCAAAACAA GGTTCATATC TTACTAACCT 
CAAGTAAAAA ATAGATATCA ATTTTTAAAC CAACACCTTA 
AGAGTCTGAT AATAACTTCA CAGATACTAC TAAAATTTTT 
TATTTAATTC AGCTACCGCA GAAAACTCTA TGTTTTTAAC 
CCATACTTAC AAGTAGGTTT TATGCCAGAA GCTCGCTCCA 
AAGTGGTCCA GGAGCTTATG CGTCAGCTTA CTATGATTTC 
AAGAAAATAC TATAGAAAAA ACTTTAAAAG CATCAGATTT 
AAATTCCCAG AAAATAATCT ATCTCAATTG ACAGAACAAG 
TCTATGGAGC TTTGATCAAG CAAGTGCAAA ATATCAATTT 
TAAGAGATTA TACTGGTGGA TCTCTTTCTG AAGACAATGG 
AATAAAAATA CTGCCCTCGA CAAAAACTAT TTATTAAATA 
ATCAAACAAT GTAGAAGAAG CTGGAAGTAA AAATTATGTT 
TACAGTTACA AGGAGATGAT ATAAGTTATG AAGCAACATG 
TCTAAAAATC CTAAAAATAG TATTATTATA CAACGAAATA 
TGCAAAAAGC TACTTTTTAA GTGATGATGG AGAATCTATT 
ATAAATATAG GATACCTGAA AGATTAAAAA ATAAGGAAAA 
ACCTTTATTG GACATGGTAA AGATGAATTC AACACAAGCG 
ATTAAGTGTA GATTCACTTT CCAATGAGAT AAGTTCATTT 
TAAAATTAGA TATATCACCT AAAAATGTAG AAGTAAACTT 
AATATGTTTA GTTATGATTT TAATGTTGAA GAAACTTATC 
GCTATTAAGT ATTATGGACA AAATTACTTC CACTTTACCT 
AAAATTCTAT TACTATAGGA GCAAATCAAT ATGAAGTAAG 
GAGGGAAGAA AAGAACTTCT GGCTCACTCA GGTAAATGGA 
AGAAGCTATT ATGAGCGATT TATCTAGTAA AGAATACATT 
CTATAGATAA TAAGCTAAAA GCAAAGTCCA AGAATATTCC 
TCAATATCAG AAGATATAAA AACATTATTA CTTGATGCAA 
TGATACAAAA TTTATTTTAA ATAATCTTAA GCTTAATATT 
TTGGGGATTA CATTTATTAT GAAAAATTAG AGCCTGTTAA 
CACAATTCTA TAGATGATTT AATAGATGAG TTCAATCTAC 
ATCTGATGAA TTATATGAAT TAAAAAAATT AAATAATCTA 
ATTTAATATC TTTTGAAGAT ATCTCAAAAA ATAATTCAAC 
AGATTTATTA ACAAAAGTAA TGGTGAGTCA GTTTATGTAG 
AGAAATTTTT T C AAAAT AT A GCGAACATAT TACAAAAGAA 
TAAAGAATAG TATAATTACA GATGTTAATG GTAATTTATT 
CAGTTAGATC ATACTTCTCA AGTTAATACA TTAAACGCAG 
TCAATCATTA ATAGATTATA GTAGCAATAA AGATGTACTG 
GTACCTCAGT TAAGGTTCAA CTTTATGCTC AACTATTTAG 
AATACTATAT ATGACTCTAT CCAATTAGTA AATTTAATAT 
AAATGATACT ATAAATGTAC TACCTACAAT AACAGAGGGG 
TATCTACTAT ATTAGACGGA ATAAACTTAG GTGCAGCAAT 
CTAGACGAAC ATGACCCATT ACTAAAAAAA GAATTAGAAG 
TGTTTTAGCA ATAAATATGT CATTATCTAT AGCTGCAACT 
TTGTTGGAAT AGGTGCTGAA GTTACTATTT TCTTATTACC 
ATATCTGCAG GAATACCTTC ATTAGTTAAT AATGAATTAA 
TAAGGCAACT TCAGTGGTAA ACTATTTTAA TCATTTGTCT 
AATATGGCCC TCTTAAAACA GAAGATGATA AAATTTTAGT 
GATTTAGTAA TATCAGAAAT AGATTTTAAT AATAATTCGA 
AACATGTAAT ATATTAGCAA TGGAGGGGGG ATCAGGACAC 
GTAATATAGA TCACTTTTTC TCATCTCCAT CTATAAGTTC 
TCATTATCAA TTTATTCTGC AATAGGTATA GAAACAGAAA 
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TTCAAAAAAA ATAATGATGT TACCTAATGC TCCTTCAAGA GTGTTTTGGT ^700 

GGGAAACTGG AGCAGTTCCA GGTTTAAGAT CATTGGAAAA TGACGGAACT 3750 

AGATTACTTG ATTCAATAAG AGATTTATAC CCAGGTAAAT TTTACTGGAG 3800 

ATTCTATGCT TTTTTCGATT ATGCAATAAC TACATTAAAA CCAGTTTATG 3850 

AAGACACTAA TATTAAAATT AAACTAGATA AAGATACTAG AAACTTCATA 3 900 

ATGCCAACTA TAACTACTAA CGAAATTAGA AACAAATTAT CTTATTCATT 3950 

TGATGGAGCA GGAGGAACTT ACTCTTTATT ATTATCTTCA TATCCAATAT 4 000 

CAACGAATAT AAATTTATCT AAAGATGATT TATGGATATT TAATATTGAT 4050 

AATGAAGTAA GAGAAATATC TATAGAAAAT GGTACTATTA AAAAAGGAAA 4100 

GTTAATAAAA GATGTTTTAA GTAAAATTGA TATAAATAAA AATAAACTTA 4150 

TTATAGGCAA T C AAACAAT A GATTTTTCAG GCGATATAGA TAATAAAGAT 4200 

AGATATATAT TCTTGACTTG TGAGTTAGAT GATAAAATTA GTTTAATAAT 4250 

AGAAATAAAT CTTGTTGCAA AATCTTATAG TTTGTTATTG TCTGGGGATA 4300 

AAAATTATTT GATATCCAAT TTATCTAATA CTATTGAGAA AATCAATACT 4350 

TTAGGCCTAG ATAGTAAAAA TATAGCGTAC AATTACACTG ATGAATCTAA 4400 

TAATAAATAT TTTGGAGCTA TATCTAAAAC AAGTCAAAAA AGCATAATAC 4450 

ATTATAAAAA AGACAGTAAA AATATATTAG AATTTTATAA TGACAGTACA 4500 

TTAGAATTTA ACAGTAAAGA TTTTATTGCT GAAGATATAA ATGTATTTAT 4550 

GAAAGATGAT ATTAATACTA TAACAGGAAA ATACTATGTT GATAATAATA 5000 

CTGATAAAAG TATAGATTTC TCTATTTCTT TAGTTAGTAA AAATCAAGTA 5050 

AAAGTAAATG GATTATATTT AAATGAATCC GTATACTCAT CTTACCTTGA 5100 

TTTTGTGAAA AATTCAGATG GACACCATAA TACTTCTAAT TTTATGAATT 5150 

TATTTTTGGA CAATATAAGT TTCTGGAAAT TGTTTGGGTT TGAAAATATA 5200 

AATTTTGTAA TCGATAAATA CTTTACCCTT GTTGGTAAAA CTAATCTTGG 5250 

ATATGTAGAA TTTATTTGTG ACAATAATAA AAATATAGAT ATATATTTTG 53 00 

GTGAATGGAA AACATCGTCA TCTAAAAGCA CTATATTTAG CGGAAATGGT 5350 

AGAAATGTTG TAGTAGAGCC TATATATAAT CCTGATACGG GTGAAGATAT 54 00 

ATCTACTTCA CTAGATTTTT CCTATGAACC TCTCTATGGA ATAGATAGAT 5450 

ATATAAATAA AGTATTGATA GCACCTGATT TATATACAAG TTTAATAAAT 55Q0 

ATTAATACCA ATTATTATTC AAATGAGTAC TACCCTGAGA TTATAGTTCT 5550 

. TAACCCAAAT ACATTCCACA AAAAAGTAAA TATAAATTTA GATAGTTCTT 5600 

CTTTTGAGTA TAAATGGTCT ACAGAAGGAA GTGACTTTAT TTTAGTTAGA 5650 

TACTTAGAAG AAAGTAATAA AAAAATATTA CAAAAAATAA GAATCAAAGG 5700 

TATCTTATCT AATACTCAAT CATTTAATAA AATGAGTATA GATTTTAAAG 5750 

ATATTAAAAA ACTATCATTA GGATATATAA TGAGTAATTT TAAATCATTT 5800 

AATT CTGAAA ATGAATTAGA TAGAGATCAT TTAGGATTTA AAATAATAGA 5850 

TAATAAAACT TATTACTATG ATGAAGATAG TAAATTAGTT AAAGGATTAA 5900 

GCAAGTAATT GACAATAATA AGTATTATTT CAATCCTGAC ACTGCTATCA 5950 

TCTCAAAAGG TTGGCAGACT GTTAATGGTA GTAGATACTA CTTTGATACT 6000 

GATACCGCTA TTGCCTTTAA TGGTTATAAA ACTATTGATG GTAAACACTT 6050 

TTATTTTGAT AGTGATTGTG TAGTGAAAAT AGGTGTGTTT AGTACCTCTA 6100 

ATGGATTTGA ATATTTTGCA CCTGCTAATA CTTATAATAA TAACATAGAA 6150 

GGTCAGGCTA TAGTTTATCA AAGTAAATTC TTAACTTTGA ATGGTAAAAA 62 00 

ATATTACTTT GATAATAACT CAAAAGCAGT TACCGGATTG CAAACTATTG 6250 

ATAGTAAAAA ATATTACTTT AATACTAACA CTGCTGAAGC AGCTACTGGA 6300 

TGGCAAACTA TTGATGGTAA AAAATATTAC TTTAATACTA ACACTGCTGA 6350 

AGCAGCTACT GGATGGCAAA CTATTGATGG TAAAAAATAT TACTTTAATA 64 00 

CTAACACTGC TATAGCTTCA ACTGGTTATA CAATTATTAA TGGTAAACAT 6450 

TTTTATTTTA ATACTGATGG TATTATGCAG ATAGGAGTGT TTAAAGGACC 6500 

TAATGGATTT GAATATTTTG CACCTGCTAA TACGGATGCT AACAACATAG 6550 

AAGGTCAAGC TATACTTTAC CAAAATGAAT TCTTAACTTT GAATGGTAAA 6600 

AAATATTACT TTGGTAGTGA CTCAAAAGCA GTTACTGGAT GGAGAATTAT 6650 

TAACAATAAG AAATATTACT TTAATCCTAA TAATGCTATT GCTGCAATTC 6700 

ATCTATGCAC TATAAATAAT GACAAGTATT ACTTTAGTTA TGATGGAATT 6750 

CTTCAAAATG GATATATTAC TATTGAAAGA AATAATTTCT ATTTTGATGC 6800 

TAATAATGAA TCTAAAATGG TAACAGGAGT ATTTAAAGGA CCTAATGGAT 6850 

TTGAGTATTT TGCACCTGCT AATACTCACA ATAATAACAT AGAAGGTCAG 6900 
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GCTATAGTTT ACCAGAACAA ATTCTTAACT TTGAATGGCA AAAAATATTA 6^950 

TTTTGATAAT GACTCAAAAG CAGTTACTGG ATGGCAAACC ATTGATGGTA 7000 

AAAAATATTA CTTTAATCTT AACACTGCTG AAGCAGCTAC TGGATGGCAA 7050 

ACTATTGATG GTAAAAAATA TTACTTTAAT CTTAACACTG CTGAAGCAGC 7100 

TACTGGATGG CAAACTATTG ATGGTAAAAA ATATTACTTT AATACTAACA 7150 

CTTTCATAGC CTCAACTGGT TATACAAGTA TTAATGGTAA ACATTTTTAT 7200 

TTTAATACTG ATGGTATTAT GCAGATAGGA GTGTTTAAAG GACCTAATGG 7250 

ATTTGAATAC TTTGCACCTG CTAATACGGA TGCTAACAAC ATAGAAGGTC 7300 

AAGCTATACT TTACCAAAAT AAATTCTTAA CTTTGAATGG TAAAAAATAT 7350 

TACTTTGGTA GTGACTCAAA AGCAGTTACC GGACTGCGAA CTATTGATGG 7400 

TAAAAAATAT TACTTTAATA CTAACACTGC TGTTGCAGTT ACTGGATGGC 74 50 

AAACTATTAA TGGTAAAAAA TACTACTTTA ATACTAACAC TTCTATAGCT 7500 

TCAACTGGTT ATACAATTAT TAGTGGTAAA CATTTTTATT TTAATACTGA 7550 

TGGTATTATG CAGATAGGAG TGTTTAAAGG ACCTGATGGA TTTGAATACT 7600 

TTGCACCTGC TAATACAGAT GCTAACAATA TAGAAGGTCA AGCTATACGT 7650 

TATCAAAATA GATTCCTATA TTTACATGAC AATATATATT ATTTTGGTAA 7700 

TAATTCAAAA GCGGCTACTG GTTGGGTAAC TATTGATGGT AATAGATATT 7750 

ACTTCGAGCC TAATACAGCT ATGGGTGCGA ATGGTTATAA AACTATTGAT 7800 

AATAAAAATT TTTACTTTAG AAATGGTTTA CCTCAGATAG GAGTGTTTAA 7850 

AGGGTCTAAT GGATTTGAAT 'ACTTTGCACC TGCTAATACG GATGCTAACA 7900 

ATATAGAAGG TCAAGCTATA CGTTATCAAA ATAGATTCCT ACATTTACTT 7950 

GGAAAAATAT ATTACTTTGG TAATAATTCA AAAGCAGTTA CTGGATGGCA 8000 

AACTATTAAT GGTAAAGTAT ATTACTTTAT GCCTGATACT GCTATGGCTG 8050 

CAGCTGGTGG ACTTTTCGAG ATTGATGGTG TTATATATTT .CTTTGGTGTT 8100 

GATGGAGTAA AAGCCCCTGG GATATATGGC TAA 8133 



2) INFORMATION FOR SEQ, ID NO: 2130 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 

(B) STRAIN: VPI 10463 

(C) ACCESSION NUMBER: X53138 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2130 



ATGAGTTTAG TTAATAGAAA ACAGTTAGAA AAAATGGCAA ATGTAAGATT 50 

TCGTACTCAA GAAGATGAAT ATGTTGCAAT ATTGGATGCT TTAGAAGAAT 100 

ATCATAATAT GTCAGAGAAT ACTGTAGTCG AAAAATATTT AAAATTAAAA 150 

GATATAAATA GTTTAACAGA TATTTATATA GATACATATA AAAAATCTGG 200 

TAGAAATAAA GCCTTAAAAA AATTTAAGGA ATATCTAGTT ACAGAAGTAT 250 

TAGAGCTAAA GAATAATAAT TTAACTCCAG TTGAGAAAAA TTTACATTTT 300 

GTTTGGATTG GAGGTCAAAT AAATGACACT GCTATTAATT ATATAAATCA 350 

ATGGAAAGAT GTAAATAGTG ATTATAATGT TAATGTTTTT TATGATAGTA 400 

ATGCATTTTT GATAAACACA TTGAAAAAAA CTGTAGTAGA ATCAGCAATA 450 

AATGATACAC TTGAATCATT TAGAGAAAAC TTAAATGACC CTAGATTTGA 500 

CTATAATAAA TTCTTCAGAA AACGTATGGA AATAATTTAT GATAAACAGA 550 

AAAATTT CAT AAACTACTAT AAAGCTCAAA GAGAAGAAAA TCCTGAACTT 600 
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ATAATTGATG ATATTGTAAA GACATATCTT TCAAATGAGT ATTCAAAGGA 650 

GATAGATGAA CTTAATACCT ATATTGAAGA ATCCTTAAAT AAAATTACAC 700 

AGAATAGTGG AAATGATGTT AGAAACTTTG AAGAATTTAA AAATGGAGAG 750 

TCATTCAACT TATATGAACA AGAGTTGGTA GAAAGGTGGA ATTTAGCTGC 800 

TGCTTCTGAC ATATTAAGAA TATCTGCATT AAAAGAAATT GGTGGTATGT 850 

ATTTAGATGT TGATATGTTA CCAGGAATAC AACCAGACTT ATTTGAGTCT 900 

ATAGAGAAAC CTAGTTCAGT AACAGTGGAT TTTTGGGAAA TGACAAAGTT 950 

AGAAGCTATA ATGAAATACA AAGAATATAT ACCAGAATAT ACCTCAGAAC 1000 

ATTTTGACAT GTTAGACGAA GAAGTTCAAA GTAGTTTTGA ATCTGTTCTA 1050 

GCTTCTAAGT CAGATAAATC AGAAATATTC TCATCACTTG GTGATATGGA 1100 

GGCATCACCA CTAGAAGTTA AAATTGCATT TAATAGTAAG GGTATTATAA 1150 

ATCAAGGGCT AATTTCTGTG AAAGACTCAT ATTGTAGCAA TTTAATAGTA 1200 

AAACAAATCG AGAATAGATA TAAAATATTG AATAATAGTT TAAATCCAGC 1250 

TATTAGCGAG GATAATGATT TTAATACTAC AACGAATACC TTTATTGATA 1300 

GTATAATGGC TGAAGCTAAT GCAGATAATG GTAGATTTAT GATGGAACTA 13 50 

GGAAAGTATT TAAGAGTTGG TTTCTTCCCA GATGTTAAAA CTACTATTAA 1400 

CTTAAGTGGC CCTGAAGCAT ATGCGGCAGC TTATCAAGAT TTATTAATGT 1450 

TTAAAGAAGG CAGTATGAAT ATCCATTTGA TAGAAGCTGA TTTAAGAAAC 1500 

TTTGAAATCT CTAAAACTAA TATTTCTCAA TCAACTGAAC AAGAAATGGC 1550 

TAGCTTATGG TCATTTGACG ATGCAAGAGC TAAAGCTCAA TTTGAAGAAT 16 00 

ATAAAAGGAA TTATTTTGAA GGTTCTCTTG GTGAAGATGA TAATCTTGAT 1650 

TTTTCTCAAA ATATAGTAGT TGACAAGGAG TATCTTTTAG AAAAAATATC 17 00 

TTCATTAGCA AGAAGTTCAG AGAGAGGATA TATACACTAT ATTGTTCAGT 1750 

TACAAGGAGA TAAAATTAGT TATGAAGCAG CATGTAACTT ATTTGCAAAG 18 00 

ACTCCTTATG ATAGTGTACT GTTTCAGAAA AATATAGAAG ATTCAGAAAT 1850 

TGCATATTAT TATAATCCTG GAGATGGTGA AATACAAGAA ATAGACAAGT 1900 

ATAAAATTCC AAGTATAATT TCTGATAGAC CTAAGATTAA ATTAACATTT 1950 

ATTGGTCATG GTAAAGATGA ATTTAATACT GATATATTTG CAGGTTTTGA 2000 

TGTAGATTCA TTATCCACAG AAATAGAAGC AGCAATAGAT TTAGCTAAAG 2050 

AGGATATTTC TCCTAAGTCA ATAGAAATAA ATTTATTAGG ATGTAATATG 2100 

TTTAGCTACT CTATCAACGT AGAGGAGACT TATCCTGGAA AATTATTACT 2150 

TAAAGTTAAA GATAAAATAT CAGAATTAAT GCCATCTATA AGTCAAGACT 22 00 

CTATTATAGT AAGTGCAAAT CAATATGAAG TTAGAATAAA TAGTGAAGGA 2250 

AGAAGAGAAT TATTGGATCA TTCTGGTGAA TGGATAAATA AAGAAGAAAG 23 00 

TATTATAAAG GATATTTCAT CAAAAGAATA TATATCATTT AATCCTAAAG 23 50 

AAAATAAAAT TACAGTAAAA TCTAAAAATT TACCTGAGCT ATCTACATTA 2400 

TTACAAGAAA TTAGAAATAA TTCTAATTCA AGTGATATTG AACTAGAAGA 24 50 

AAAAGTAATG TTAACAGAAT GTGAGATAAA TGTTATTTCA AATATAGATA 2500 

CGCAAATTGT TGAGGAAAGG ATTGAAGAAG CTAAGAATTT AACTTCTGAC 2550 

TCTATTAATT ATATAAAAGA TGAATTTAAA CTAATAGAAT CTATTTCTGA 26 00 

TGCACTATGT GACTTAAAAC AACAGAATGA ATTAGAAGAT TCTCATTTTA 2650 

TATCTTTTGA GGACATATCA GAGACTGATG AGGGATTTAG TATAAGATTT 2700 

ATTAATAAAG AAACTGGAGA ATCTATATTT GTAGAAACTG AAAAAACAAT 2750 

ATTCTCTGAA TATGCTAATC ATATAACTGA AGAGATTTCT AAGATAAAAG 2800 

GTACTATATT TGATACTGTA AATGGTAAGT TAGTAAAAAA AGTAAATTTA 2850 

GATACTACAC ACGAAGTAAA TACTTTAAAT GCTGCATTTT TTATACAATC 2900 

ATTAATAGAA TATAATAGTT CTAAAGAATC TCTTAGTAAT TTAAGTGTAG 2950 

CAATGAAAGT CCAAGTTTAC GCTCAATTAT TTAGTACTGG TTTAAATACT 3 000 

ATTACAGATG CAGCCAAAGT TGTTGAATTA GTATCAACTG CATTAGATGA 3 050 

AACTATAGAC TTACTTCCTA CATTATCTGA AGGATTACCT ATAATTGCAA 3100 

CTATTATAGA TGGTGTAAGT TTAGGTGCAG CAATCAAAGA GCTAAGTGAA 3150 

ACGAGTGACC CATTATTAAG ACAAGAAATA GAAGCTAAGA TAGGTATAAT 32 0 0 

GGCAGTAAAT TTAACAACAG CTACAACTGC AATCATTACT TCATCTTTGG 3250 

GGATAGCTAG TGGATTTAGT ATACTTTTAG TTCCTTTAGC AGGAATTTCA 33 00 

GCAGGTATAC CAAGCTTAGT AAACAATGAA CTTGTACTTC GAGATAAGGC 33 50 

AACAAAGGTT GTAGATTATT TTAAACATGT TTCATTAGTT GAAACTGAAG 34 00 

GAGTATTTAC TTTATTAGAT GATAAAATAA TGATGCCAGA AGATGATTTA 3450 
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GTGATATCAG AAATAGATTT TAATAATAAT TCAATAGTTT TAGGTAAATG 3500 

TGAAATCTGG AGAATGGAAG GTGGTTCAGG TCATACTGTA ACTGATGATA 3550 

TAGATCACTT CTTTTCAGCA CCATCAATAA CATATAGAGA GCCACACTTA 3600 

TCTATATATG ACGTATTGGA AGTACAAAAA GAAGAACTTG ATTTGTCAAA 3650 

AGATTTAATG GTATTACCTA ATGCTCCAAA TAGAGTATTT GCTTGGGAAA 3700 

CAGGATGGAC ACCAGGTTTA AGAAGCTTAG AAAATGATGG CACAAAACTG 3750 

TTAGACCGTA TAAGAGATAA CTATGAAGGT GAGTTTTATT GGAGATATTT 3800 

TGCTTTTATA GCTGATGCTT TAATAACAAC ATTAAAACCA AGATATGAAG 3850 

ATACTAATAT AAGAATAAAT TTAGATAGTA ATACTAGAAG TTTTATAGTT 3900 

CCAATAATAA CTACAGAATA TATAAGAGAA AAATTATCAT ATTCTTTCTA 3 950 

TGGTTCAGGA GGAACTTATG CATTGTCTCT TTCTCAATAT AATATGGGTA 4000 

TAAATATAGA ATTAAGTGAA AGTGATGTTT GGATTATAGA TGTTGATAAT 4050 

GTTGTGAGAG ATGTAACTAT AGAATCTGAT AAAATTAAAA AAGGTGATTT 4100 

AATAGAAGGT ATTTTATCTA CACTAAGTAT TGAAGAGAAT AAAATTATCT 4150 

TAAATAGCCA TGAGATTAAT TTTTCTGGTG AGGTAAATGG AAGTAATGGA 4200 

TTTGTTTCTT TAACATTTTC AATTTTAGAA GGAATAAATG CAATTATAGA 4250 

AGTTGATTTA TTATCTAAAT CATATAAATT ACTTATTTCT GGCGAATTAA 4300 

AAATATTGAT GTTAAATTCA AATCATATTC AACAGAAAAT AGATTATATA 43 50 

GGATTCAATA GCGAATTACA GAAAAATATA CCATATAGCT TTGTAGATAG 4400 

TGAAGGAAAA GAGAATGGTT TTATTAATGG TTCAACAAAA GAAGGTTTAT 4450 

TTGTATCTGA ATTACCTGAT GTAGTTCTTA TAAGTAAGGT TTATATGGAT 4500 

GATAGTAAGC CTTCATTTGG ATATTATAGT AATAATTTGA AAGATGTCAA 4550 

AGTTATAACT AAAGATAATG TTAATATATT AACAGGTTAT TATCTTAAGG 4600 

ATGATATAAA AATCTCTCTT TCTTTGACTC TACAAGATGA AAAAACTATA 4650 

AAGTTAAATA GTGTGCATTT AGATGAAAGT GGAGTAGCTG AGATTTTGAA 4700 

GTTCATGAAT AGAAAAGGTA ATACAAATAC TTCAGATTCT TTAATGAGCT 4750 

TTTTAGAAAG TATGAATATA AAAAGTATTT TCGTTAATTT CTTACAATCT 4800 

AATATTAAGT TTATATTAGA TGCTAATTTT ATAATAAGTG GTACTACTTC 4850 

TATTGGCCAA TTTGAGTTTA TTTGTGATGA AAATGATAAT ATACAACCAT 4 900 

ATTTCATTAA GTTTAATACA CTAGAAACTA ATTATACTTT ATATGTAGGA 4 950 

AATAGACAAA ATATGATAGT GGAACCAAAT TATGATTTAG ATGATTCTGG 5000 

AGATATATCT TCAACTGTTA TCAATTTCTC TCAAAAGTAT CTTTATGGAA 5050 

TAGACAGTTG TGTTAATAAA GTTGTAATTT CACCAAATAT TTATACAGAT 5100 

GAAATAAATA TAACGCCTGT ATATGAAACA AATAATACTT ATCCAGAAGT 5150 

TATTGTATTA GATGCAAATT ATATAAATGA AAAAATAAAT GTTAATATCA 5200 

ATGATCTATC TATACGATAT GTATGGAGTA ATGATGGTAA TGATTTTATT 5250 

CTTATGTCAA CTAGTGAAGA AAATAAGGTG TCACAAGTTA AAATAAGATT 5300 

CGTTAATGTT TTTAAAGATA AGACTTTGGC AAATAAGCTA TCTTTTAACT 5350 

TTAGTGATAA ACAAGATGTA CCTGTAAGTG AAATAATCTT ATCATTTACA 5400 

CCTTCATATT ATGAGGATGG ATTGATTGGC TATGATTTGG GTCTAGTTTC 5450 

TTTATATAAT GAGAAATTTT ATATTAATAA CTTTGGAATG ATGGTATCTG 5500 

GATTAATATA TATTAATGAT TCATTATATT ATTTTAAACC ACCAGTAAAT 5550 

AATTTGATAA CTGGATTTGT GACTGTAGGC GATGATAAAT ACTACTTTAA 5600 

TCCAATTAAT GGTGGAGCTG CTTCAATTGG AGAGACAATA ATTGATGACA 5650 

AAAATTATTA TTTCAACCAA AGTGGAGTGT TACAAACAGG TGTATTTAGT 5700 

ACAGAAGATG GATTTAAATA TTTTGCCCCA GCTAATACAC TTGATGAAAA 5750 

CCTAGAAGGA GAAGCAATTG ATTTTACTGG AAAATTAATT ATTGACGAAA 5800 

ATATTTATTA TTTTGATGAT AATTATAGAG GAGCTGTAGA ATGGAAAGAA 5850 

TTAGATGGTG AAATGCACTA TTTTAGCCCA GAAACAGGTA AAGCTTTTAA 5900 

AGGTCTAAAT CAAATAGGTG ATTATAAATA CTATTTCAAT TCTGATGGAG 5950 

TTATGCAAAA AGGATTTGTT AGTATAAATG ATAATAAACA CTATTTTGAT 6000 

GATTCTGGTG TTATGAAAGT AGGTTACACT GAAATAGATG GCAAGCATTT 6050 

CTACTTTGCT GAAAACGGAG AAATGCAAAT AGGAGTATTT AATACAGAAG 6100 

ATGGATTTAA ATATTTTGCT CATCATAATG AAGATTTAGG AAATGAAGAA 6150 

GGTGAAGAAA TCTCATATTC TGGTATATTA AATTTCAATA ATAAAATTTA 6200 

CTATTTTGAT GATTCATTTA CAGCTGTAGT TGGATGGAAA GATTTAGAGG 6250 

ATGGTTCAAA GTATTATTTT GATGAAGATA CAGCAGAAGC ATATATAGGT 6300 
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TTGTCATTAA TAAATGATGG TCAATATTAT 
GCAAGTTGGA TTTGTCACTA TAAATGATAA 
CTGGAATTAT AGAATCTGGA GTACAAAACA 
ATAGATGATA ATGGTATAGT TCAAATTGGT 
ATATAAATAT TTTGCACCTG CTAATACTGT 
AAGCAGTTGA ATATAGTGGT TTAGTTAGAG 
TTTGGAGAAA CATATACAAT TGAGACTGGA 
TGAAAGTGAT AAATATTATT TCAATCCAGA 
GTATTAATTT AATTGATGAT ATAAAATATT 
ATGAGAACGG GTCTTATATC ATTTGAAAAT 
GAATGGTGAA ATGCAATTTG GTTATATAAA 
ATTTTGGTGA AGATGGTGTC ATGCAGATTG 
GGATTTAAAT ACTTTGCACA TCAAAATACT 
AGAATCAATA AACTATACTG GTTGGTTAGA 
ATTTTACAGA TGAATATATT GCAGCAACTG 
GAGGAGTATT ATTTTGATCC TGATACAGCT 
G 



PCT/CA00/01150 

TTTAATGATG ATGGAATTAT 6350 

AGTCTTCTAC TTCTCTGACT 6400 

TAGATGACAA TTATTTCTAT 6450 

GTATTTGATA CTTCAGATGG 6500 

AAATGATAAT ATTTACGGAC 6550 

TTGGGGAAGA TGTATATTAT 6600 

TGGATATATG ATATGGAAAA 6650 

AACTAAAAAA GCATGCAAAG 6700 

ATTTTGATGA GAAGGGCATA 6750 

AATAATTATT ACTTTAATGA 6800 

TATAGAAGAT AAGATGTTCT 6850 

GAGTATTTAA TACACCAGAT 6900 

TTGGATGAGA ATTTTGAGGG 6950 

TTTAGATGAA AAGAGATATT 7000 

GTTCAGTTAT TATTGATGGT 7050 

CAATTAGTGA TTAGTGAATA 7100 

7101 



2) INFORMATION FOR SEQ ID NO: 2131 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION : : SEQ ID NO: 2131 
AAGCMATTGT TGTAAATTTT GAAAG 



2) INFORMATION FOR SEQ ID NO: 2132 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2132 
TCATATCCAT AGCAATAGTT CTA 



2) INFORMATION FOR SEQ ID NO: 2133 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

* 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2133 
CCACGCACAW AAACTTGTTT TAGAAGTAGC AGCWCAGCGT GG 



2) INFORMATION FOR SEQ ID NO: 2134 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2134 
CGCGTGAAGC TTCTGTG 



2) INFORMATION FOR SEQ ID NO: 2135 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2135 
TCTCGCCGTT ATTCAGTTTC 



2) INFORMATION FOR SEQ ID NO: 2136 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2136 
CCAACGCGTC CTCAATCATT TCTAACTTCT ATGGCCGGCG TTGG 
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2) INFORMATION FOR SEQ ID NO: 2137 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 889 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas putida 

(C) ACCESSION NUMBER: Genome project 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2137 

ATGATCACCG GTGCTGCCCA GATGGACGGC GCGATCCTGG TTTGCTCGGC 50 

CGCCGATGGT CCGATGCCrC AAACCCGTGA GCACATCCTG CTGTCCCGTC 100 

AGGTAGGCGT TCCGTACATC GTGGTCTTCC TGAACAAGGC CGACCTGGTA 150 

GACGACGCTG AGCTGCTGGA ACTGGTCGAG ATGGAAGTTC GCGACCTGCT 200 

GTCCACCTAC GACTTCCCAG GCGACGACAC TCCGATCATC ATCGGTTCCG 250 

CTCGTATGGC CCTGGAAGGC AAAGACGACA . ACGAAATGGG CACTACCGCT 300 

GTCAAGAAGC TGGTAGAAAC TCTGGATGCC TACATCCCTG AGCCAGTTCG 350 

TGCCATCGAC CAGCCGTTCC TGATGCCGAT CGAAGACGTG TTCTCGATCT 400 

CGGGTCGTGG TACCGTTGTT ACCGGTCGTA TCGAGCGTGG CATCGTTCGC 450 

GTTCAGGATC CGCTGGAAAT CGTTGGTCTG CGTGACACCA CCACCACCAC 500 

CTGCACCGGT GTTGAGATGT TCCGCAAGCT GCTGGACGAA GGTCGTGCTG 550 

GCGAGAACTG CGGCGTTCTG CTGCGTGGTA CCAAGCGTGA CGACGTTGAG 600 

CGTGGCCAGG TTCTGGTCAA GCCAGGTTCG GTCAAGCCGC ACACCAAGTT 650 

CACCGCAGAA GTCTACGTCC TGTCGAAGGA AGAAGGCGGC CGTCACACTC 700 

CGTTCTTCAA AGGCTACCGT CCACAGTTCT ACTTCCGTAC CACTGACGTG 750 

ACCGGTAACT GCGAACTGCC GGAAGGCGTT GAAATGGTAA TGCCAGGTGA 800 

CAACATTCAG ATGACTGTTA CCCTGATCAA GACCATCGCA ATGGAAGACG 850 

GTCTGCGCTT CGCTATCCGT GAAGGCGGTC GTACCGTCG 889 



2) INFORMATION FOR SEQ ID NO: 2138 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 900 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Pseudomonas aeruginosa 

(B) STRAIN: PAO-1 

(C) ACCESSION NUMBER: Genome project 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2138 
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AACATGATCA CCGGTGCTGC 
GGCTGCCGAC GGCCCCATGC 
GCCAGGTAGG CGTTCCCTAC 
GTCGACGACG CCGAGCTGCT 
GCTGAACACC TACGACTTCC 
CCGCGCTGAT GGCGCTGGAA 
GCCGTGCAGA AGCTGGTAGA 
TCGTGCCATC GACCAGCCGT 
TCTCCGGCCG CGGTACCGTG 
AAGGTCCAGG AAGAAGTGGA 
TACCTGCACC GGCGTTGAAA 
CTGGTGAGAA CGTTGGTATC 
GAGCGTGGCC AGGTTCTGGC 
GTTCGAGTGC GAAGTGTACG 
CCCCGTTCTT CAAGGGCTAC 
GTGACCGGTA ACTGCGAACT 
CGACAACATC AAGATGGTTG 
ATGGCCTGCG CTTCGCGATC 



CCAGATGGAC 
CGCAGACCCG 
ATCGTCGTGT 
GGAACTGGTC 
CGGGCGACGA 
GGCAAGGATG 
GACCCTGGAC 
TCCTGATGCC 
GTAACCGGTC 
AATCGTCGGC 
TGTTCCGCAA 
CTGCTGCGTG 
CAAGCCGGGC 
TGCTGTCCAA 
CGTCCGCAGT 
GCCGGAAGGC 
TCACCCTGAT 
CGCGAAGGCG 



GGCGCGATCC 
CGAGCACATC 
TCCTGAACAA 
GAGATGGAAG 
CACTCCGATC 
ACAACGGCAT 
TCCTACATTC 
GATCGAAGAC 
GTGTAGAGCG 
ATCAAGGCGA 
GCTGCTCGAC 
GCACCAAGCG 
ACCATCAAGC 
GGAAGAAGGT 
TCTACTTCCG 
GTAGAGATGG 
CGCTCCGATC 
GCCGTACCGT 



TGGTTTGCTC 
CTGCTGTCCC 
GGCCGACATG 
TTCGCGATCT 
ATCATCGGTT 
CGGCGTAAGC 
CGGAGCCGGT 
GTGTTCTCGA 
CGGCATCATC 
CCACCAAGAC 
GAAGGTCGTG 
TGAAGACGTA 
CGCACACCAA 
GGTCGTCACA 
TACCACCGAC 
TAATGCCGGG 
GCCATGGAAG 
TGGCGCCGGC 



50 
100 
150 
200 
250 
300 
350 
400 
450 
500 
550 
600 
650 
700 
750 
800 
850 
900 



2) INFORMATION FOR SEQ ID NO: 2139 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1398 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni 

(B) STRAIN: NCTC 11168 

(C) ACCESSION NUMBER: AL139074 

(xi) ? SEQUENCE DESCRIPTION: *SEQ ID NO: 2139 

ATGCAAGGAT TTATTTCACA GGTATTAGGT CCGGTTGTTG ATGTAGATTT 50 

TAACGACTAT TTGCCTCAAA TTAATGAAGC AATTGTTGTA AATTTTGAAA 100 

GCGAAGGAAA AAAACATAAA CTTGTTTTAG AAGTAGCAGC TCATTTAGGA 150 

GATAATAGAG TTAGAACTAT TGCTATGGAT ATGACAGATG GTTTGGTAAG 200 

GGGCTTAAAA GCTGAGGCTT TAGGTGCTCC CATTAGTGTT CCTGTTGGTG 250 

AGAAAGTTTT AGGAAGAATT TTCAATGTTA CTGGAGATTT GATCGATGAA 300 

GGTGAAGAAA TTTCTTTCGA TAAAAAATGG GCAATTCATA GAGATCCACC 350 

AGCTTTTGAA GATCAAAGCA CAAAAAGTGA GATTTTTGAA ACAGGGATTA 400 

AAGTTGTAGA TTTACTTGCT CCTTATGCAA AAGGTGGTAA AGTAGGTCTT 450 

TTTGGTGGTG CAGGTGTTGG TAAAACTGTT ATTATTATGG AGCTTATTCA 500 

CAATGTTGCA TTTAAGCATA GCGGCTATTC TGTATTTGCA GGTGTGGGTG 550 

AGAGAACTCG TGAAGGAAAT GACCTTTATA ATGAAATGAA AGAAAGTAAT 600 

GTTTTAGACA AAGTTGCTCT ATGTTATGGA CAAATGAATG AACCACCAGG 650 

GGCAAGAAAT CGTATTGCTT TAACAGGTTT AACAATGGCT GAGTATTTTA 700 

GAGATGAAAT GGGTCTTGAT GTGCTTATGT TTATTGATAA TATCTTTAGA 750 

TTTTCACAAT CAGGTTCTGA AATGTCAGCA CTTTTAGGAA GAATTC C ATC 800 

AGCTGTGGGT TATCAACCAA CCTTAGCAAG TGAAATGGGT AAATTCCAAG 850 

AAAGAATTAC TTCAACTAAA AAAGGCTCAA TCACTTCAGT TCAAGCTGTT 900 

TACGTTCCAG CTGATGACTT AACAGACCCA GCTCCAGCAA CTGTTTTTGC 950 
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TCACTTAGAT GCTACAACGG TTTTAAATAG 
TTTATCCTGC AGTTGACCCA CTTGACTCAA 
AATATCATTG GAGAAGAACA TTATAAAGTT 
GCTTCAAAAA TACAAAGATT TACAAGATAT 
ATGAGCTTAG CGAAGAGGAT AAACTTGTAG 
GAAAAATTCT TATCACAACC ATTTTTCGTT 
CCCAGGAAAA TATATAAGCC TTGAAGATAC 
TTTTAGAAGG TAAATATGAT CATTTACCAG 
GGAAATATAG ATGAAGCTAT TGCAAAAGCG 



PCT/CA00/01150 

AGCTATTGCT GAAAAGGGTA 1000 

CTTCAAGAAT GCTTGACCCA 1050 

GCTCGTGGTG TTCAATCAGT 1100 

CATCGCCATT TTAGGTATGG 1150 

TTGAAAGAGC AAGAAAGATT 1200 

GCAGAAGTTT TCACAGGTAG 1250 

AATAGCAGGA TTTAAGGGAA 1300 

AAAATGCTTT CTATATGGTT 1350 

GATAAATTAA AAGGTTAA 1398 



2) INFORMATION FOR SEQ ID NO: 2140 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2157 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus pneumoniae 

(B) STRAIN: 670 

(C) ACCESSION NUMBER: X67867 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2140 

ATGAACAAAC CAACGATTCT GCGCCTAATC AAGTATCTGA GCATTAGCTT 50 

CTTAAGCTTG GTTATCGCAG CCATTGTCTT AGGCGGAGGA GTTTTTTTCT 100 

ACTACGTTAG CAAGGCTCCT AGCCTATCCG AGAGTAAACT AGTTGCAACA 150 

ACTT CTAGT A AAATCTACGA CAATAAAAAT CAACTCATTG CTGACTTGGG 200 

TTCTGAACGC CGCGTCAATG CCCAAGCTAA TGATATTCCC ACAGATTTGG 250 

TTAAGGCAAT CGTTTCTATC GAAGACCATC GCTTCTTCGA CCACAGGGGG 300 

ATTGATTCCA TCCGTATCCT GGGAGCTTTC TTGCGCAATC TGCAAAGTAA 350 

TTCCCTCCAA GGTGGATCAA CTCTCACCCA ACAGTTGATT AAGTTGACTT 4 00 

ACTTTTCAAC CTCGACTTCC GACCAGACTA TTTCTCGTAA GGCTCAGGAA 450 

GCTTGGTTAG CGATTCAGTT AGAACAAAAA GCAACCAAAC AGGAAAT CTT 500 

GACCTACTAT ATAAATAAGG TCTACATGTC TAATGGCAAC TATGGAATGC 550 

AGACAGCAGC TCAAAACTAC TATGGTAAAG ACCTCAATAA TTTAAGTTTA 600 

CCTCAGTTAG CCTTGCTGGC TGGAATGCCT CAGGCACCAA ACCAATATGA 6 50 

CCCCTATTCA CATCCAGAAG CAGCCCAAGA CCGCCGAAAC TTGGTCTTAT 700 

CTGAAATGAA AAATCAAGGT TACATCTCTG CTGAACAGTA TGAGAAAGCA 750 

GTCAATACAC CAATTACTGA TGGACTACAA AGTCTCAAAT CAGCAAGTAA 800 

TTACCCTGCT TACATGGATA ATTACCTCAA GGAGGTCATC AATCAAGTAG 850 

AACAAGAAAC TGGCTATAAC CTTCTAACTA CTGGGATGGA TGTTTACACA 900 

AATGTAGACC AAGAAGCTCA AAAACATCTG TGGGATATCT ACAACTCCGA 950 

TCAATACGTC TCTTACCCTG ACGATGATTT GCAAGTCGCA TCTACGGTCG 1000 

TAGATGTTTC AAATGGTAAA GTCATCGCCC AACTTGGAGC TCGTCACCAA 1050 

GCAAGTAACG TTTCATTTGG TACCAACCAA GCTGTGGAAA CCAATCGTGA 1100 

CTGGGGTTCT GCTATGAAAC CAATCACCGA TTATGCACCT GCCATAGAAT 1150 

ACGGTGTTTA TGATTCCACT GCAACTATGG TTAATGATAT TCCTTATAAC 1200 

TATCCGGGAA CAAGCACACC TGTCTACAAC TGGGATAGAG CATATTTCGG 1250 

TAATATTACT CTGCAATATG CTCTTCAACA ATCACGAAAT GTCACAGCCG 1300 

TTGAGACTTT GAATAAGGTC GGTCTAGATA GAGCTAAAAC CTTCCTTAAT 1350 

GGTCTTGGTA TCGACTATCC AAGCATGCAT TATGCAAACG CCATTTCAAG 1400 
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TAATACAACA GAATCTAATA AACAATACGG 
CTGCTGCTTA TGCTGCCTTT GCCAACGGTG 
TATATCAATA AGGTCGTCTT CAGTGACGGT 
TGTAGGTACA CGAGCTATGA AAGAAACAAC 
TGATGAAAAC TGTCTTGGCA TACGGAACTG 
TGGTTAGCGC AAGCTGGTAA GACAGGTACT 
AATTGAAAAA CACATCAAGA ACACTGGCTA 
TTGTTGGTTA TACTCGTAAG TATTCTATGG 
AATCGTTTAA CTCCTATCGT TGGAGATGGT 
TTATCGCTCA ATGATAACGT ATCTATCAGA 
GGACGATGCC AGACGGACTT TTCAGAAACG 
GGAGCTCGTT CTACGTGGAG CTCACCTGCT 
TGAAAGTTCA AGCTCATCAT CAGATAGTTC 
CCACTCCAAG CACAAATAAT AGTACGACTA 
CAACAATCAA ATACAACCCC TGATCAACAA 
ACAACCA 



PCT/CA00/01150 

AGCAAGTAGT GAAAAAATGG f450 

GAATTTACCA CAAACCCATG 1500 

AGTAAAAAAG AATTTTCAGA 1550 

TGCTTACATG ATGACCGAAA 1600 

GTCGTGGAGC CTATCTCCCA 1650 

TCTAACTACA CAGATGATGA 1700 

TGTAGCTCCA GATGAAATGT 1750 

CTGTATGGAC AGGTTATTCG 1800 

TTCCTAGTTG CAGCTAAAGT 1850 

AGATACTCAT CCAGAAGACT 1900 

GGGAATTTGT ATTCAAAAAT 1950 

CCACAACAAC CCCCATCAAC 2000 

AACTTCACAG TCTAGCTCAA 2050 

CCGATCCTAA CAATAATACG 2100 

AATCAGAATC CTCAACCAGC 2150 

2157 



2) INFORMATION FOR SEQ ID NO: 2141 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2141 
CACCAGTCAC AGAAAAGC 18 



2) INFORMATION FOR SEQ ID NO: 2142 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2142 
GCCTTAATTT CGGATAGTGC 20 



2) INFORMATION FOR SEQ ID NO: 2143 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 

GAGAAAGAGC CCAATTATCT AATGT 



2) INFORMATION FOR SEQ ID NO: 2144 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3075 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 

(B) STRAIN: J2870 

(C) ACCESSION NUMBER: X75439 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2144 

TTGACAAAGA AATATTTAAA CACCCAGAAT GAAATATCAG CATTTTGGAA 50 

TACTCAAAAG ATATTTAAAA AATCAATTGA CAATAGAAAA GGACAGGAAA 100 

GTTTTGTTTT TTATGACGGC CCCCCAACTG CAAATGGCCT TCCTCATGCT 150 

GGCCATGTTC TTGGAAGAGT AATCAAGGAT TTAGTTGCAA GATTAAAAAC 200 

TATGCAAGGT TTTTATGTAG AAAGAAAAGC AGGATGGGAT ACCCATGGCT 250 

TACCAGTTGA ATTAGAGGTT GAAAAAAAAA TTGGAATTAA AGGAAAACAA 3 00 

GACATTGAAA AGTATGGAAT AGAAAATTTT ATAAATGAAT GTAAAAAAAG 350 

TGTATTTAAT TATGAAAAAG AATGGCGGGA TTTTTCTAAA GATTTAGGAT 400 

ACTGGGTTGA CATGGACTCC CCCTATATAA CTCTTGAGAA TAATTATATT 450 

GAAAGTGTAT GGAATATATT ATCTACATTC CATAAAAAAG GACTATTATA 500 

TAAGGGACAT AAGGTGACTC CTTATTGTAC ACATGATCAA ACCGCTTTAA 550 

GTTCTCATGA AGTAGCGCAA GGCTATAAAA ACGTTAAAGA TTTATCAGCT 600 

GTTGTTAAAT TTCAACTTAC AAATAGTAAA GATACTTATT TCTTAAGTTG 650 

GACTACCACT CCCTGGACTT TGCCTGCAAA TGTAGCATTA GCTATAAATA 700 

AAGATCTTAA TTATTCAAAA ATTCGGGTAG AAAATGAGTA TTATATCTTA 750 

GCTACAGATC TAATTAATTC TATAATAACT GAAAAATACG AAATTATTGA 800 

TACCTTTTCA GGAAGTAATT TAATTAATTT AAAATACATT CCTCCTTTTG 850 

AAAGCGACGG TTTAGTTAAT GCATATTACG TTGTTGATGG AGAATTTGTT 900 

ACTAACTCAG AAGGAACTGG TATTGTTCAT ATAGCACCAG CTCATGGGGA 950 

AGATGACTAC CAATTGGTTT TAGAGCGTGA TTTGGATTTC TTAAATGTTA 1000 

TAACAAGAGA AGGAGTATAT AATGATAGGT TCCCTGAATT AGTTGGTAAT 1050 

AAAGCTAAAA ATAGTGATAT AGAAATCATA AAATTATTAT CCAAAAAACA 1100 

ACTTTTATAT AAAAAACAAA AATATGAGCA TAATTATCCT CATTGTTGGA 1150 

GATGTGGTAA TCCTTTGATA TATTATGCGA TGGAAGGTTG GTTTATTAAA 1200 

ACAACTAATT TTAAGAATGA AATTATTAAC AATAATAATA ATATAGAGTG 1250 

GTTTCCTTCT CATATTAAGG AAGGGAGAAT GGGAAATTTC TTAGAAAATA 1300 

TGGTTGATTG GAACATTGGT AGAAATAGAT ATTGGGGAAC ACCATTAAAT 1350 

GTATGGATTT GCAATGATTG TAATCACGAA TACGCACCAA GTAGTATTAA 1400 

GGATTTACAA AATAATTCCA TCAATAAAAT TGATGAAGAT ATTGAGTTGC 1450 

ATAGACCTTA TGTTGATAAT ATCACTCTTA GTTGCCCTAA GTGTAATGGG 1500 
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AAAATGTCTC GAGTAGAAGA AGTAATCGAT GTTTGGTTTG ATAGCGGCTC 1550 

TATGCCGTTT GCTCAGCATC ATTATCCTTT TGATAACCAG AAAATTTTTA 1600 

ATCAACACTT TCCAGCTGAT TTTATTGCAG AAGGAGTTGA TCAAACGAGA 1650 

GGCTGGTTTT ACAGTTTACT AGTAATTTCT ACTATTCTAA AAGGAAAATC 1700 

TTCTTATAAA CGTGCTTTAT CTTTAGGACA TATTCTAGAC AGTAATGGTA 1750 

AAAAAATGTC TAAAAGTAAA GGAAACGTTA TTAATCCAAC TGAATTAATT 1800 

AATAAGTACG GAGCCGATTC TTTAAGATGG GCCTTAATTT CGGATAGTGC 1850 

TCCATGGAAT AACAAAAGAT TCTCAGAAAA TATAGTAGCT CAGACCAAAT 1900 

CGAAATTTAT AGATACGCTT GATAATATTT ATAAATTTTA TAATATGTAT 1950 

AATAAAATAG ATCACTATAA TCCTAATAAT GAAATTACAA AAAGTAGAAA 2000 

TACATTAGAT AATTGGGCTC TTTCTCGCTT AAACACCTTA ATAAAAGAAA 2050 

GTAATATTTA TGTAAATAAT TACGATTTCA CTTCCGCAGC CAGATTAATT 2100 

AACGAATATA CCAATACAAT AAGTAATTGG TATATCGGAG ATTCGAGAGG 2150 

ACGATTTTGG GAACAAGGAA TTTCTAACGA TAAAAAAGAT GCGTACAATA 220 0 

CGCTTTATGA AATTTTAACA ACTTTATCAA GACTAGTGGC TCCATTTGTT 2250 

CCATTTATAT CTGAAAAAAT CCATTATAAT TTGACTGGAA AAAGTGTGCA 2300 

TTTACAAGAT TATCCACAAT ATAAAGAAAG TTTTATTAAT CAAGCATTGG 2350 

AAGATGAAAT GCATACCGTT ATAAAAATTG TAGAATTATC TAGACAGGCT 2400 

CGCAAAAATG CAGATTTAAA AATTAAGCAA CCTTTATCGA AAATGGTGAT 2450 

TAAACCTAAT AGTCAATTAA ACTTAAGTTT TTTACCTAAT TACTATTCAA 2500 

TAATAAAAGA CGAATf AAAT ATAAAAAACA TTGAATTAAC TGATAATATT 2550 

AATGACTATA TTACCTATGA GCTTAAATTG AATTTTTCTT CTGTGGGACC 2600 

AAAACTAGGG AACAAAACGA AAAATATTCA AACATTGATA GACTCCCTAT 2650 

CAGAGTATGA TAAAAAAAGT TTAATTGAGT CTAATAACTT CAAAAGTTTA 2700 

TCTTCTGATG CTGAGTTAAC TAAGGATGAT TTTATAATTA AAACCTTACC 2750 

TAAGGATAGT TATCAACTCA GTGAAGATAA TGACTGCGTT ATATTATTAG 2800 

ATAAAAATTT ATCTCCTGAA TTAATTCGCG AAGGACATGC TAGAGAGCTC 2850 

ATTAGATTAA TTCAACAATT AAGAAAAAAG AAAAATTTAC CAATAAATCA 2900 

ACGTATTGAT ATTTATATCG GTGTAACTGG GGAATTATTA GAATCAATAA 2950 

AAACCAATAA AAATATGTTT AAAGAAAATT TCGTGATTAA AAATATACAC 3000 

TTAAATGTTA TAGATGAATA TGAAAATACT ATTCATTTTA ATAATAAAGA 3 050 

AATAAAAATT TCCTTATTAT ATTAA 3075 



2) INFORMATION FOR SEQ ID NO: 2145 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2145 
GCAAGATGTG GCGTGTTACG GT 



2) INFORMATION FOR SEQ ID NO: 2146 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2146 
GGGGCGAAGA AGTTGTCCAT ATT 



2) INFORMATION FOR SEQ ID NO: 2147 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 660 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 
(C) ACCESSION NUMBER: V00622 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2147 

ATGGAGAAAA AAATCACTGG ATATACCACC GTTGATATAT CCCAATGGCA 50 

TCGTAAAGAA CATTTTGAGG CATTTCAGTC AGTTGCTCAA TGTACCTATA 100 

ACCAGACCGT TCAGCTGGAT ATTACGGCCT TTTTAAAGAC CGTAAAGAAA 150 

AATAAGCACA AGTTTTATCC GGCCTTTATT CACATTCTTG CCCGCCTGAT 200 

GAATGCTCAT CCGGAATTCC GTATGGCAAT GAAAGACGGT GAGCTGGTGA -250 

TATGGGATAG TGTTCACCCT TGTTACACCG TTTTCCATGA GCAAACTGAA 3 00 

ACGTTTTCAT CGCTCTGGAG TGAATACCAC GACGATTTCC GGCAGTTTCT 350 

ACACATATAT TCGCAAGATG TGGCGTGTTA CGGTGAAAAC CTGGCCTATT 400 

TCCCTAAAGG GTTTATTGAG AATATGTTTT TCGTCTCAGC CAATCCCTGG 450 

GTGAGTTTCA CCAGTTTTGA TTTAAACGTG GCCAATATGG ACAACTTCTT 500 

CGCCCCCGTT TTCACCATGG GCAAATATTA TACGCAAGGC GACAAGGTGC 550 

TGATGCCGCT GGCGATTCAG GTTCATCATG CCGTCTGTGA TGGCTTCCAT 600 

GTCGGCAGAA TGCTTAATGA ATTACAACAG TACTGCGATG AGTGGCAGGG 650 

CGGGGCGTAA 660 



2) INFORMATION FOR SEQ ID NO: 2148 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2148 
CAGATTAAAT GCGGATTCAG CC 
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2) INFORMATION FOR SEQ ID NO: 2149 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2149 
ATCAGGTAAA TCATCAGCGG ATA 



2) INFORMATION FOR SEQ ID NO: 2150 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia . coli 

(B) STRAIN: K12 

(C) ACCESSION NUMBER: X53796 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2150 

ATGAATTTTA CCCGGATTGA CCTGAATACC TGGAATCGCA GGGAACACTT 50 

TGCCCTTTAT CGTCAGCAGA TTAAATGCGG ATTCAGCCTG ACCACCAAAC 100 

TCGATATTAC CGCTTTGCGT ACCGCACTGG CGGAGACAGG TTATAAGTTT 150 

TATCCGCTGA TGATTTACCT GATCTCCCGG GCTGTTAATC AGTTTCCGGA 200 

GTTCCGGATG- GCACTGAAAG ACAATGAACT TATTTACTGG GAC CAGTCAG 250 

ACCCGGTCTT TACTGTCTTT CATAAAGAAA CCGAAACATT CTCTGCACTG 3 00 

TCCTGCCGTT ATTTTCCGGA TCTCAGTGAG TTTATGGCAG GTTATAATGC 3 50 

GGTAACGGCA GAATATCAGC ATGATACCAG ATTGTTTCCG CAGGGAAATT 400 

TACCGGAGAA TCACCTGAAT ATATCATCAT TACCGTGGGT GAGTTTTGAC 450 

GGATTTAACC TGAACATCAC CGGAAATGAT GATTATTTTG CCCCGGTTTT 500 

TACGATGGCA AAGTTTCAGC AGGAAGGTGA CCGCGTATTA TTACCTGTTT 550 

CTGTACAGGT TCATCATGCA GTATGTGATG GCTTTCATGC AGCACGGTTT 600 

ATTAATACAC TTCAGCTGAT GTGTGATAAC ATACTGAAAT AA 642 



2) INFORMATION FOR SEQ ID NO: 2151 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2151 
ATATTTCAGC ATTACCTTGG GTT 23 



2) INFORMATION FOR SEQ ID NO: 2152 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2152 
TACACAACTC TTGTAGCCGA TTA 23 



2) INFORMATION FOR SEQ ID NO: 2153 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: • 642 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Shigella flexineri 
(C) ACCESSION NUMBER: X07848 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2153 

ATGAACTATA CAAAATTTGA TGTAAAAAAT TGGGTTCGCC GTGAGCATTT 50 

TGAGTTTTAT CGGCATCGTT TACCATGTGG TTTTAGCTTA ACAAGCAAAA 100 

TTGATATCAC GACGTTAAAA AAGTCATTGG ATGATTCAGC GTATAAGTTT 150 

TATCCGGTAA TGATCTATCT GATTGCTCAG GCCGTGAATC AATTTGATGA 2 00 

GTTGAGAATG GCGATAAAAG ATGATGAATT GATCGTATGG GATTCAGTCG 250 

ACCCACAATT CACCGTATTC CAT C AAG AAA CAGAGACATT TTCAGCACTG 3 00 

AGTTGCCCAT ACTCATCCGA TATTGATCAA TTTATGGTGA ATTATTTATC 350 

GGTAATGGAA CGTTATAAAA GTGATACCAA GTTATTTCCT CAAGGGGTAA 400 

CACCAGAAAA TCATTTAAAT ATTTCAGCAT TACCTTGGGT TAATTTTGAT 450 

AGCTTTAATT TAAATGTTGC TAATTTTACC GATTATTTTG CACCCATTAT 500 

AACAATGGCA AAATATCAGC AAGAAGGGGA TAGACTGTTA TTGCCGCTCT 550 

CAGTACAGGT TCATCATGCA GTTTGTGATG GCTTCCATGT TGCACGCTTT 600 

ATTAATCGGC TACAAGAGTT GTGTAACAGT AAATTAAAAT AA 642 
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2) INFORMATION FOR SEQ ID NO: 2154 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2154 
CGCCATTCAG AGTTTAGGAC 20 



2) INFORMATION FOR SEQ ID NO: 2155 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2155 
TTCCATACCG TTGCGTATCA CTT 23 



2) INFORMATION FOR SEQ ID NO: 2156 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 624 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium perfringens 

(B) STRAIN: CP590 

(C) ACCESSION NUMBER: M74769 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2156 

ATGGTATTTG AAAAAATTGA TAAAAATAGT TGGAACAGAA AAGAGTATTT 50 

TGACCACTAC TTTGCAAGTG TACCTTGTAC ATACAGCATG ACCGTTAAAG 100 

TGGATATCAC ACAAATAAAG GAAAAGGGAA TGAAACTATA TCCTGCAATG 150 

CTTTATTATA TTGCAATGAT TGTAAACCGC CATTCAGAGT TTAGGACGGC 200 

AATCAATCAA GATGGTGAAT TGGGGATATA TGATGAGATG ATACCAAGCT 250 
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ATACAATATT TCACAATGAT ACTGAAACAT 
TGTAAGTCTG ACTTTAAATC ATTTTTAGCA 
ACGGTATGGA AACAATCATA GAATGGAAGG 
ACATTTTTAA TGTATCTATG ATACCGTGGT 
CTGAATTTGC AGAAAGGATA TGATTATTTG 
GAAATATTAT AAAGAAGATA ACAAAATTAT 
TTCATCACGC AGTATGTGAC GGATTTCACA 
TTGCAGGAAT TGATAAATAG TTAA 
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TTTCCAGCCT TTGGACTGAG 300 

GATTATGAAA GTGATACGCA 350 

AAAGCCAAAT GCTCCGGAAA 4 00 

CAACCTTCGA TGGCTTTAAT , 450 

ATTCCTATTT TTACTATGGG 500 

ACTTCCTTTG GCAATTCAAG 550 

TTTGCCGTTT TGTAAACGAA 600 

624 



2) INFORMATION FOR SEQ ID NO: 2157 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2157 
CCACAGAAAT TGATATTAGT GTTTTAT 27 



2) INFORMATION FOR SEQ ID NO: 2158 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2158 
TCGCTATTGT AACCAGTTCT A 21 



2) INFORMATION FOR SEQ ID NO: 2159 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 651 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: V01277 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2159 



ATGAACTTTA ATAAAATTGA TTTAGACAAT 
TAATCATTAT TTGAACCAAC AAACGACTTT 
ATATTAGTGT TTTATACCGA AACATAAAAC 
CCTGCATTTA TTTTCTTAGT GACAAGGGTG 
TAGAACTGGT TACAATAGCG ACGGAGAGTT 
AGCCACTTTA TACAATTTTT GATGGTGTAT 
TGGACTCCTG TAAAGAATGA CTTCAAAGAG 
TGATGTAGAG AAATATAATG GTTCGGGGAA 
TACCTGAAAA TGCTTTTTCT CTTTCTATTA 
GGGTTTAACT TAAATATCAA TAATAATAGT 
TACAGCAGGA AAATT CATTA ATAAAGGTAA 
CTTTACAGGT ACATCATTCT GTTTGTGATG 
ATGAACTCTA TTCAGGAATT GTCAGATAGG 
A 



TGGAAGAGAA AAGAGATATT 50 

TAGTATAACC ACAGAAATTG 100 

AAGAAGGATA TAAATTTTAC 150 

ATAAACTCAA ATACAGCTTT 200 

AGGTTATTGG GATAAGTTAG 250 

CTAAAACATT CTCTGGTATT 300 

TTTTATGATT TATACCTTTC 350 

ATTGTTTCCC AAAACACCTA 400 

TTCCATGGAC TTCATTTACT 450 

AATTACCTTC TACCCATTAT 500 

TTCAATATAT TTACCGCTAT 550 

GTTATCATGC AGGATTGTTT 6 00 

CCTAATGACT GGCTTTTATA 650 

651 



2) INFORMATION FOR SEQ ID NO: 2160 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2160 
TTTTGAACAC TATTTTAACC AGC 23 



2) INFORMATION FOR SEQ ID NO: 2161 

(i) SEQUENCE CHARACTERISTICS: 

(A) ' LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2161 
GATTTAACTT ATCCCAATAA CCT 23 



2) INFORMATION FOR SEQ ID NO: 2162 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 648 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
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(D) TOPOLOGY: Linear 
(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus aureus 
(C) ACCESSION NUMBER: X02166 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2162 



ATGACTTTTA ATATTATCAA ATTAGAAAAT 
TGAACACTAT TTTAACCAGC AAACTACGTA 
ATATTACTTT GTTTAAAGAT ATGATAAAAA 
CCTTCTTTGA TTTATGCAAT TATGGAAGTT 
TAGAACAGGA ATTAATAGTG AGAATAAATT 
ATCCTTTGTA TACAGTTTTT AATAAGCAAA 
TGGACTGAAT CTGATAACAA CTTCACTTCT 
TGACTTGCTT GAATATAAAG ATAAAGAAGA 
TACCTGAAAA CACCATACCG ATTTCAATGA 
TCATTTAATT TAAACATTGG TAACAATAGC 
TACGATAGGT AAATTTTATA GTGAGAATAA 
CTTTGCAGCT TCATCATGCT GTATGTGATG 
ATGAATGAAT TTCAAGATAT AATTCATAAG 



TGGGATAGAA AAGAATATTT 50 

TAGCATTACT AAAGAAATTG 100 

AGAAAGGATA TGAAATTTAT 150 

GTAAATAAAA ATAAAGTGTT 200 

AGGTTATTGG GATAAGTTAA 250 

CTGAAAAATT TACTAACATT 3 00 

TTTTATAATA ATTATAAAAA 350 

AATGTTTCCT AAAAAACCGA 4 00 

TTCCTTGGAT TGATTTTAGT 450 

AACTTTTTAT TGCCTATTAT 500 

TAAAATTTAT ATACCAGTTG 550 

GTTACCATGC TTCATTATTT 600 

GTAGATGATT GGATTTAG 648 



2) INFORMATION FOR SEQ ID NO: 2163 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2163 
ACCTTCATCC TACCGATGTG GGTT 24 



2) INFORMATION FOR SEQ ID NO: 2164 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2164 
CAACGACACC AGCACTGCCA TTG 23 



2) INFORMATION FOR SEQ ID NO: 2165 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1215 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella typhimurium 

(B) STRAIN: H3380 

(C) ACCESSION NUMBER: AF071555 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2165 

ATGACCACCA CACGCCCCGC GTGGGCCTAT ACGCTGCCGG CAGCACTGCT 50 

GCTGATGGCT CCTTTCGACA TCCTCGCTTC ACTGGCGATG GATATTTATC 100 

TCCCTGTCGT TCCAGCGATG CCCGGCATCC TGAACACGAC GCCCGCTATG 150 

ATCCAACTCA CGTTGAGCCT CTATATGGTG ATGCTCGGCG TGGGCCAAGT 200 

GATTTTTGGT CCGCTCTCAG ACAGAATCGG GCGACGGCCA ATTCTACTTG 250 

CGGGCGCAAC GGCTTTCGTC ATTGCGTCTC TGGGAGCAGC TTGGTCTTCA 300 

ACTGCACCGG CCTTTGTCGC TTTCCGTCTA CTTCAAGCAG TGGGCGCGTC 350 

GGCCATGCTG GTGGCGACGT TCGCGACGGT TCGCGACGTT TATGCCAACC 400 

GTCCTGAGGG TGTCGTCATC TACGGCCTTT TCAGTTCGAT GCTGGCGTTC 450 

GTGCCTGCGC TCGGCCCTAT CGCCGGAGCA TTGATCGGCG AGTTCTTGGG 500 

ATGGCAGGCG ATATTCATTA CTTTGGCTAT ACTGGCGATG CTCGCACTCC 550 

TAAATGCGGG TTTCAGGTGG CACGAAACCC GCCCTCTGGA TCAAGTCAAG 600 

ACGCGCCGAT CTGTCTTGCC GATCTTCGCG AGTCCGGCTT TTTGGGTTTA 650 

CACTGTCGGC TTTAGCGCCG GTATGGGCAC CTTCTTCGTC TTCTTCTCGA 700 

CGGCTCCCCG TGTGCTCATA GGCCAAGCGG AATATTCCGA GATCGGATTC 750 

AGCTTTGCCT TCGCCACTGT CGCGCTTGTA ATGATCGTGA CAACCCGTTT 800 

CGCGAAGTCC TTTGTCGTCA GATGGGGCAT CGCAGGATGC GTGGCGCGTG 850 

GGATGGCGTT GCTTGTTTGC GGAGCGGTCC TGTTGGGGAT CGGCGAACTT 900 

TACGGCTCGC CGTCATTCCT CACCTTCATC CTACCGATGT GGGTTGTCGC 950 

GGTCGGTATT GTCTTCACGG TGTCCGTTAC CGCGAACGGC GCTTTGGCAG 1000 

AGTTCGACGA CATCGCGGGA TCAGCGGTCG CGTTCTACTT CTGCATCCAA 1050 

AGCCTGATAG TCAGTATCGT CGGGACATTG GCGGTGACGC TGTTAAACGG 1100 

CGATACAGCG TGGCCCGTGA TTTGTTACGC CACGGCAATG GCAGTGCTGG 1150 

TGTCGTTGGG GCTGGCGCTC CTTCGATCCC GTGATGCTGC CACCGAGAAG 1200 

TCGCCAGTCG TCTAG 1215 



2) INFORMATION FOR SEQ ID NO: 2166 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2166 
GACAAACCAT TCCTGCTG 
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2) INFORMATION FOR SEQ ID NO: 2167 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2167 
CAGCAGCTGG GCGGCGGT 18 



2) INFORMATION FOR SEQ ID NO: 2168 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: • DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2168 
CATCAAAGTT GGTGAAGAAG TTG 23 



2) INFORMATION FOR SEQ ID NO: 2169 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2169 
CCCGTTTGCG AAAGGTGG 18 



2) INFORMATION FOR SEQ ID NO: 2170 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 
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(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2170 
ACGTGACGTT GACAAACCA 19 



2) INFORMATION FOR SEQ ID NO: 2171 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2171 
TCGTTGGATT AACTGAAGAA 20 
2) INFORMATION FOR SEQ ID NO: 2172 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2172 
GTGTTGAAAT GTTCCGTAAA 20 



2) INFORMATION FOR SEQ ID NO: 2173 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2173 
AAGAAAAAAT CTTCGAACTG GCTA 24 



2) INFORMATION FOR SEQ ID NO: 2174 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2174 
TCTACACGGC CGGTG 



2) INFORMATION FOR SEQ ID NO: 2175 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2175 
CCGCCATACC CCGTTT 



2) INFORMATION FOR SEQ ID NO: 2176 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217 6 
CGGCATTACC ATTTCCACAC CTTT 



2) INFORMATION FOR SEQ ID NO: 2177 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 49 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2177 

GGCACGGACA AACCATTCCT GCTGCCTATC GAAGACGTGT TCCCGTGCC 49 



2) INFORMATION FOR SEQ ID NO: 2178 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:. 2178 
GGCACGACAA ACCATTCCTG CTGCCTATCG AACGTGCC 38 



2) INFORMATION FOR SEQ ID NO: 2179 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 39 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2179 
GGCAGCTCTA CTTCCGTACC ACTGACGTAA CCGGCTGCC 39 



2) INFORMATION FOR SEQ ID NO: 2180 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2180 
TTCGCCGGCG TGGGC 15 



2) INFORMATION FOR SEQ ID NO: 2181 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 15 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2181 
AGCGCCACGC GCAGG 



2) INFORMATION FOR SEQ ID NO: 2182 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2182 
GCGCGCCAAC GACTTCTACC ACGAAATGGA AGAGTCGCGC GC 
2) INFORMATION FOR SEQ ID NO: 2183 

(i) SEQUENCE CHARACTERISTICS: 

(A) ■ LENGTH: 817 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Alcaligenes faecalis subsp. faecalis 

(B) STRAIN: ATCC 8750 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2183 

TATCTTGGTT WGCTCGGCCG CTGACGGCCC AATGCCTCAG ACTCGCGAGC 50 

ACATCCTGCT GAGCCGTCAG GTTGGCGTTC CTTACATCAT CGTGTTCCTG 100 

AACAAGGCCG ACATGGTTGA TGACGAAGAG CTGATCGAAC TGGTTGAAAT 150 

GGAAGTTCGC GAGCTGTTGT CCAAGTACGA CTTCCCTGGC GACGACACCC 2 00 

CGATCATCAA GGGTTCGGCC AAACTGGCTC TGGAAGGCGA CGAAGGCCCA 250 

CTGGGCAGCC AAGCCGTTCT GGCTCTGGCC GAAGCGCTGG ACAACTACAT 3 00 

TCCTACGCCT GAGCGTGCCG TTGACGGTAC GTTCCTGATG CCTGTTGAAG 3 50 

ACGTGTTCTC GATCTCCGGC CGTGGTACGG TTGTGACCGG TCGTATTGAG 4 00 

CGCGGCATCA TCAAGGTCGG CGAAGAAATC GAAATCGTGG GTATCAAAGA 4 50 

CACGGTCAAG ACCATTTGTA CCGGCGTTGA AATGTTCCGC AAACTGCTGG 500 

ACCAGGGCGA AGCTGGCGAT AACGTCGGTC TGCTGCTGCG TGGTACCAAG 550 

CGTGAAGACG TGGAACGTGG TCAAGTTCTG GCCAAGCCAG GCTCGATCAA- 600 

GCCACACACT GACTTCGACG CCGAGGTGTA CATTCTGTCC AAAGAAGAAG 650 

GTGGTCGTCA CACTCCTTTC TTCAAGGGCT ACCGTCCTCA GTTCTACTTC 700 
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CGTACAACTG ACGTGACCGG CACCATCGAG CTGCCAGAAG ACAAGGAAAT 750 
GGTTCTGCCA GGCGACAACA TTTCGATGAA AGTGTCCCTG ATCGCTCCTA 800 
TCGCCATGGA AGAAGGT 817 



2) INFORMATION FOR SEQ ID NO: 2184 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1652 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter coli 

(B) STRAIN: ATCC 43479 



(xi) SEQUENCE DESCRIPTION: SEQ 

TACAATTGAA GTTGAAAGAT CTATGCGTGT 
TATTTTGTTC AGTGGGTGGG GTGCAGCCTC 
CAAGCAAATA AATATGGTGT TCCAAGAATA 
TAGAATCGGT GCAAATTTCT ACAATGTAGA 
TAAAAGCTAA TCCAGTTCCA CTTCAAATTC 
TTTAAAGGCG TAATCGATCT TGTAACTATG 
TGATACTAAG CCAACGGATT ATGTAGAAAA 
AAGAAAAGGC AGAAGAATAT CGCACAAAAA 
ACTTCAGATG AGTTGATGGA AAAATATTTA 
TGAAGAGATT AAAACAGGGA TTAAAGCAGG 
TTCCTATGCT TTGCGGTACA GCGTTTAAAA 
CTTGATGCTG TTGTGGCTTA TTTACCAGCT 
CAAAGGGGAA TATGAAGACG GCACAGAAGT 
ATGGCGAATT TGCAGGACTT GCATTTAAAA 
GGACAACTTA CTTTCGTGCG TGTTTATCGT 
TTATGCTTAT AACTCAACCA AAGATAAAAA 
TAAAAATGCA CTCTAATAAA AGAGAAGAAA 
GAAATCGGTG CAGTTGTAGG ACTTAAAGAT 
TGCAAGTGAA AAAGATAAAG TAATTCTTGA 
CAGTTATTTC TGTTGCAGTA GAMCCAAAAA 
ATGTCTATTG CACTAAATAA ATTAGCTCAA 
TTCTACGGAT GAAGAAAGTG GTCAAACTAT 
TACACCTTGA AATTATTGTT GATCGTATGC 
GCTGAAGTGG GACAACCTCA AGTTGCTTAT 
TGTTGAGCAA GAATACAAAT ACGCTAAGCA 
ATGGACATGT ATTCTTACGC CTTGAGCCAC 
GAGTTTGTTA ACGACATCAA AGGTGGGGTA 
TGCAGTAGAT AAGGGTGTTC AAGAAGCATT 
GTTATCCAGT AGAAGATGTT AAAGTAACTG 
GAGGTGGATT CATCTGAGAT GGCATTTAAA 
TAAAGAGGGT GCTAGAAAAG CAGGTGCTGT 
AAGTTGAAGT AGAAACTCCT GAAGATTACA 
CTTAATAAGC GTCGTGGTCA AGTAAATAGC 
TA 



ID NO: 2184 



TCTTGATGGT 


GCTGTTGCGG 


50 


AAAGTGAAAC 


*bl ^*bi bbbbI BBBBh bbbbb ^*Bk bbl ^bh BBk 

AGTTTGGAGA 


100 


GTATTTGTAA 


ATAAAATGGA 


150 


AGATCAAATT 


CGCAACCGTT 


200 


CAATCGGTGC 


TGAGGATAAT 


250 


AAAGCTTTAG 


TTTGGGAAGA 


300 


AGAAATTCCA 


GCTGAACTTA 


350 


TGATAGAAGC 


AGTTTCTGAA 


400 


GGTGGAGAAG 


AATTAAGCCT 


450 


ATGTTTAAGT 


CTTTCTATCG 


500 


ATAAAGGGGT 


TCAACCTTTG 


550 


CCTGATGAAG 


TTGCTAATAT 


600 


TTCTGTAAAA 


TCAACTGATG 


650 


TTATGACAGA 


TCCATTTGTA 


700 


GGATGTTTAG 


AAAGCGGTTC 


750 


AGAAAGAATT 


GGTCGTTTGT 


800 


TTAAAGTTCT 


TTACGCAGGA 


850 


ACTTTAACAG 


GGGATACTCT 


900 


AAGAATGGAT 


TTCCCAGACC 


950 


CTAAAGCAGA 


TCAAGAAAAA 


1000 


GAAGATC CAA 


GCTTTAGAGT . 


1050 


CATTTCAGGT 


ATGGGTGAAT 


1100 


TTCGTGAATT 


TAAAGTGGAA 


1150 


CGTGAAACTA 


TCAGAAAAAC 


1200 


ATCAGGCGGT 


CGTGGTCAGT 


1250 


TTGAGCCAGG 


AAGTGGATAC 


1300 


ATT C CAAAAG 


AATATATTCC 


1350 


GCAAAATGGT 


GTTTTAGCAG 


1400 


TTTATGATGG 


AAGTTATCAC 


1450 


CTTGCTGCTT 


CTATGGGATT 


1500 


GATCTTAGAG 


CCTATGATGA 


1550 


TGGGCGATGT 


TATTGGCGAT 


1600 


ATGGATGAAA 


GAGGTGGTAA 


1650 






1652 
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2) INFORMATION FOR SEQ ID NO: 2185 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 820 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 
"(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Succinivibrio dextrinosolvens 

(B) STRAIN: ATCC 19716 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2185 

GCTATTCTAG TAGTAGCAGC AACTGATGGT CCTATGCCAC AGACCCGTGA 50 

GCACATCCTA TTAGCACGTC AGGTAGGCGT ACCATACATC ATCGTATTCC 100 

TAAACAAGTG CGATATGGTT GACGACGAGG AATTATTAGA GTTAGTTGAG 150 

ATGGACGTAC GTGATCTATT AAATCAGTAC CAGTTCCCAG GCGACGACAC 200 

TCCAATCATC CGTGGTTCAG CACTAGGTGC ATTAAACGGC GAAGAGAAGT 250 

GGAAAGAGGC AATCTATCAG TTAGCAGACA CTCTAGATTC ATACATTCCA 300 

GAGCCAAAGC GTGATATCGA TGATC CATTC CTATTACCAA TCGAAGATAT 3 50 

CTTCTCAATC TCAGGTCGTG GTACTGTAGT AACCGGCCGT GTAGAGCGTG 400 

GTATTGTACA CGTAGGTGAC GAAGTTGAAA , TCGTTGGTAT TCGTCCAACC 450 

ACCAAGACCA CTGTAACTGG CGTTGAAATG TT.CCGTAAGT TACTAGACGA 500 

AGGTCGTGCA GGTGATAACG TTGGTGTTCT ACTACGTGGT ACCAAGCGTG 550 

ATGAGGTTGA GCGTGGTCAG GTTCTAGCTG CTCCAGGCAC AATCACTCCA 6 00 

CACACCAAGT TCACTGGTCA GGTTTACGTA CTAAGCAAGG ATGAAGGTGG 650 

TCGTCACACT CCATTCTTCA AGGGCTACCG TCCACAGTTC TTCTTCCGTA 700 

CAACCGATAT TACCGGTTCT ATCGATCTGA AAGAGGGCGT AGAGATGGTA 750 

ATGCCAGGTG ATAACACCGA CATGACCGTA ACCCTAATCC ACCCAGTAGC 800 

TATGGCTGAA GGCGAGAGAT 820 



2) INFORMATION FOR SEQ ID NO: 2186 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2186 
ACGCGCTCAA AGCAGAAGTA TACGTATTAT CAAAAGACGC GCGT 



2) INFORMATION FOR SEQ ID NO: 2187 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1612 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni subsp. jejuni 

(B) STRAIN: ATCC 33292 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2187 

CAGTGGGTGG GGTGCAGCCT CAAAGTGAAA CAGTTTGGAG ACAAGCAAAT ' 50 

AAATATGGTG TTCCAAGAAT AGTATTTGTA AACAAAATGG ATAGAATCGG 100 

TGCAAATTTC TACAATGTAG AAGATCAAAT TCGCAACCGT TTAAAAGCTA 150 

ATCCAGTTCC ACTTCAAATT CCAATCGGTG CTGAGGATAA TTTTAAAGGC 200 

GTAATCGATC TTGTAACTAT GAAAGCTTTA GTTTGGGAAG ATGATACTAA 250 

GCCAACGGAT TATGTAGAAA AAGAAATTCC AGCTGAACTT AAAGAAAAGG 300 

CAGAAGAATA TCGCACAAAA ATGATAGAAG CAGTTTCTGA AACTTCAGAT 350 

GAGTTGATGG AAAAATATTT AGGCGGAGAA GAATTAAGCC TTGAAGAGAT 400 

TAAAACAGGG ATTAAAGCAG GATGTTTAAG TCTTTCTATC GTTCCTATGC 450 

TTTGCGGTAC AGCGTTTAAA AATAAAGGGG TTCAACCTTT ACTTGATGCT 500 

GTTGTGGCTT ATTTACCAGC TCCTGATGAA GTGGCAAATA TCAAGGGTGA 550 

ATATGAAGAT GGCACAGAAG TTTCTGTAAA ATCAACTGAT GATGGCGAGT 600 

TTGCAGGACT TGCATTTAAA ATTATGACAG ATCCATTTGT AGGACAACTT 650 

ACTTTCGTGC GTGTTTATCG CGGTTGTTTA GAAAGTGGTT CTTATGCTTA 700 

TAACTCAACT AAAGATAAAA AAGAAAGAAT TGGTCGTTTG TTAAAAATGC 750 

ACTCTAACCA AAGAGAAGAG ATTAAAGTGC TTTACGCAGG CGAAATTGGT 800 

GCTGTTGTAG GACTTAAAGA TACTTTAACA GGGGATACTC TTGTAAGTGA 850 

AAAAGATAAG GTAATCCTTG AAAGAATGGA TTTTCCAGAT CCAGTTATTT 900 

CTGTTGCAGT TGAGCCAAAA ACTAAAGCAG ATCAAGAAAA AATGTCTATT 950 

GCTTTAAATA AATTAGCACA AGAAGATCCA AGTTTTAGAG TTTCTACAGA 1000 

TGAAGAAAGT GGCCAAACTA TCATTTCAGG TATGGGTGAG TTACACCTTG 1050 

AAATTATCGT TGATAGAATG CTTCGTGAAT TTAAAGTTGA AGCTGAAGTA 1100 

GGTCAACCAC AAGTTGCTTA TCGCGAAACT ATTAGAAAAA CTGTTGAACA 1150 

AGAATACAAA TACGCTAAAC AATCAGGTGG TCGTGGTCAG TATGGACATG 1200 

TATTCTTACG CCTTGAACCA CTTGAGCCAG GTAGTGGATA TGAATTTGTT 1250 

AATGATATCA AAGGTGGAGT AATTCCAAAA GAATACATTC CTGCAGTTGA 13 00 

TAAAGGTGTT CAAGAAGCAT TACAAAATGG TGTTTTAGCA GGTTATCCTG 1350 

TGGAAGATGT TAAAGTAACT GTTTATGATG GAAGTTATCA CGAGGTGGAT 1400 

TCATCTGAGA TGGCGTTTAA ACTTGCTGCT TCTATGGGCT TTAAAGAAGG 1450 

TGCTAGAAAA GCAGGTGCTG TGATCTTAGA GCCTATGATG AAAGTTGAAG 1500 

TAGAAACTCC TGAAGATTAC ATGGGTGATG TTATTGGAGA TCTTAACAAA 1550 

CGCCGTGGTC AAGTAAATAG CATGGATGAG CGTGGTGGAA ATAAAATCAT 1600 

CACAGCATTT TG 1612 



2) INFORMATION FOR SEQ ID NO: 2188 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1667 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Campylobacter jejuni subsp. jejuni 

(B) STRAIN: ATCC 33560 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2188 



ACAATTGAAG TTGAAAGATC 
ATTTTGTTCA GTGGGTGGGG 
AAGCAAATAA ATATGGTGTT 
AGAATCGGTG CAAATTTCTA 
AAAAGCTAAT CCAGTTCCAC 
TTAAAGGCGT AATCGATCTT 
GATACTAAGC CAACGGATTA 
AGAAAAGGCA GAAGAATATC 
CTTCAGATGA GTTGATGGAA 
GAAGAGATTA AAACAGGGAT 
TCCTATGCTT TGCGGTACAG 
TTGATGCTGT TGTGGCTTAT 
AAGGGTGAAT ATGAAGATGG 
TGGCGAGTTT GCAGGACTTG 
GACAACTTAC TTTCGTGCGT 
TATGCGTATA ACTCAACTAA 
AAAAATGCAC TCTAACAAAA 
AAATTGGTGC TGTTGTAGGA 
GCAAGTGAAA AAGATAAGGT 
AGTTATTTCT GTTGCAGTTG 
TGTCTATTGC TTTAAATAAA 
TCTACAGATG AAGAAAGTGG 
ACACCTTGAA ATTATCGTTG 
CTGAAGTAGG TCAACCACAA 
GTTGAACAAG AATACAAATA 
TGGACATGTA TTCTTACGCC 
AATTTGTTAA TGATAT CAAA 
GCAGTTGATA AAGGTGTTCA 
TTATCCTGTG GAAGATGTTA 
AGGTGGATTC ATCTGAGATG 
AAAGAAGGTG CTAGAAAAGC 
AGTTGAAGTA GAAACTCCTG 
TTAACAAACG CCGTGGTCAA 
AAAAT CAT C A CAGCATT 



TATGCGTGTT CTTGATGGTG 
TGCAGCCTCA AAGTGAAACA 
CCAAGAATAG TATTTGTAAA 
CAATGTAGAA GATCAAATTC 
TTCAAATTCC AATTGGTGCT 
GTAACTATGA AAGCTTTAGT 
TGTAGAAAAA GAAATTCCAG 
GCACAAAAAT GATAGAAGCA 
AAATATTTAG GCGGAGAAGA 
TAAAGCAGGA TGTTTAAGTC 
CGTTTAAAAA TAAAGGGGTT 
TTACCAGCTC CTGATGAAGT 
CACAGAAGTT TCTGTAAAAT 
CATTTAAAAT CATGACAGAT 
GTTTATCGCG GTTGTTTAGA 
AGATAAAAAA GAAAGAATTG 
GAGAAGAGAT TAAAGTGCTT 
CTTAAAGATA CTTTAACAGG 
AATCCTTGAA AGAATGGATT 
AGCCAAAAAC TAAAGCTGAT 
TTAGCACAAG AAGATCCAAG 
TCAAACTATC ATTTCAGGTA 
ATAGAATGCT TCGTGAATTT 
GTTGCTTATC GCGAAACTAT 
CGCTAAACAA TCAGGTGGTC 
TTGAACCACT TGAGCCAGGT 
GGTGGGGTAA TTCCAAAAGA 
AGAAGCATTA CAAAATGGTG 
AAGTAACTGT TTATGATGGA 
GCGTTTAAAC TTGCTGCTTC 
AGGCGCTGTG ATCTTAGAGC 
AAGATTATAT GGGTGATGTT 
GTAAATAGCA TGGATGAGCG 



*n ^^^k 

CTGTTGCGGT 


50 


GTTTGGAGAC 


100 


TAAAATGGAT 


150 


GCAACCGTTT 


200 


GAGGATAATT 


250 


TTGGGAAGAT 


300 


CTGAACTTAA 


350 


GTTTCTGAAA 


400 


ATTAAGCCTT 


450 


TTTCTATCGT 


500 


CAACCTTTGC 


550 


GGCAAATATC 


600 


CAACTGATGA 


650 


CCATTTGTAG 


700 


AAGCGGTTCT 


750 


GTCGTTTGTT 


800 


TACGCAGGCG 


850 


GGATACTCTT 


900 


TTCCAGATCC 


950 


CAAGAAAAAA 


1000 


TTTTAGAGTT 


1050 


TGGGTGAGTT 


1100 


AAAGTTGAAG 


1150 


TAGAAAAACT 


1200 


GTGGTCAGTA 


1250 


AGTGGATATG 


1300 


ATACATTCCT 


1350 


TTTTAGCAGG 


1400 


AGTTATCACG 


1450 


TATGGGCTTT 


1500 


CTATGATGAA 


1550 


ATTGGAGATC 


1600 


TGGTGGAAAT 


1650 




1667 



2) INFORMATION FOR SEQ ID NO: 2189 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1255 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Leishmania guyanensis 

(B) STRAIN: ATCC 50126 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 2189 

TGACGGCGCT GGACGTGACG GAGGACCTCG GCCGCGATGA GCCGCTGACG 50 

CTGGAGATCG TGCAGCACTT GGATGCGAAC ACCGGCCGCT GCATTGCGAT 100 

GCAGACGACG GACCTGCTGA AGCTGAAGTC GAAGGTTGTG TCGACCGGCG 150 

GCAACATCTC CGTGCCGGTG GGCCGCGAGA CACTGGGCCG CATCTTCAAC 200 

GTGCTGGGCG ACGCGATTGA CCACCGCGGC CCCGTGTGCG AGAAGATGCG 250 

CATGGCGATC CACGCCGAGG CGCCGAAGCT GGCGGACCAG GCTGCGGAGG 300 

ACACGATCCT GACGACCGGC ATCAAGGTGA TCGACCTGAT TCTGCCCTAC 350 

TGCAAGGGCG GCAAGATCGG CCTGTTCGGC GGTGCCGGTG TGGGCAAGAC 400 

TGTGATCATC ATGGAGCTGA TCAACAACGT CGCGAAGGGG CACGGCGGCT 450 

TCTCCGTGTT CGCCGGCGTT GGCGAGCGCA CGCGCGAGGG CACGGACCTG 500 

TACCTGGAGA TGATGCAGTC AAAGGTGATT GACCTGAAGG GCGAGTCGAA 550 

GTGCGTGCTT GTGTACGGGC AGATGAACGA GCCCCCGGGT GCGCGCGCGC 600 

GCGTTGCGCA GTCTGCGCTG ACGATGGCCG AGTACTTCCG CGACGTGGAG 650 

GGCCAGAACG TGCTGCTGTT CATCGACAAC ATCTTCCGCT TCACGCAGGC 700 

GAACTCCGAG GTGTCTGCGC TGCTGGGCCG CATCCCGGCC GCCGTGGGTT 750 

ACCAGCCGAC GCTTGCGGAG GATCTTGGCA TGCTGCAGGA GCGCATTACG 800 

TCGACGACGA AGGGATCGAT TACGTCTGTG CAGGCTGTGT ACGTGCCTGC 850 

GGATGATATC ACGGACCCCG CGCCCGCGAC GACGTTCTCG CACCTGGACG 900 

CGACGACTGT GCTGGACCGC GCGGTGGCGG AGTCGGGCAT CTACCCTGCC 950 

GTGAACCCGC TGGAGTGCGC GTCGCGCATC ATGGACCCCG ATGTGATCGA 1000 

CGTGGACCAT TACAACGTTG CACAGGATAT CGTCCAGATG CTGACCAAGT 1050 

ACAAGGAGCT GCAGGACATC ATTGCGGTGC TTGGCATCGA CGAGCTGAG C 1100 
GAGGAGGACA AGGTCGTGGT GGACCGCGCG CGCAAGGTGA CGCGGTTCCT ■ . 1150 

GTCGCAGCCG TTCCAGGTTG CGGAGGTGTT CACCGGCATG ACGGGCCACT 1200 

ACGTGCAGCT GAGCGACACG GTGGAGTCGT TCTCTGGCCT GCTGATGGGG 1250 

TCGTA < 1255 



2) INFORMATION FOR SEQ ID NO: 2190 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1248 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Trypanosoma brucei subsp. Jbrucei 

(B) STRAIN: EATRO 795 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2190 

GCCCTTGACG TTGTTGACAA ACTTGGCCGT GATGAGCCTC TTACTCTTGA 50 

GATCGTGCAG CATCTTGACG CCCACACGGG CCGCTGTATC GCGATGCAAA 100 

CGACGGATCT CCTCAAACTG AAGGCAAAGG TCGTTTCGAC AGGTGGCAAC 150 

ATTTCCGTTC CTGTCGGCCG GGAAACACTA GGTCGTATCT TCAACGTGCT 2 00 

TGGAGACGCT ATTGACCAGC GCGGCCCCGT TGGTGAGAAA CTGCGCATGC 250 

CCATCCATGC CGTGGCTCCC AAGCTTGCGG ACCAGGCCGC TGAGGATGCG 3 00 

GTGCTCACAA CTGGTATTAA GGTGATTGAT CTCATTCTCC CTTACTGCAA 3 50 
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AGGTGGAAAA ATTGGCCTCT TTGGGGGTGC 
TTATTATGGA GCTCATTAAC AACGTTGCCA 
GTCTTCGCTG GTGTTGGTGA GCGTACCCGT 
TGAGATGATG CAGTCTAAGG TTATTGACCT 
TGTTGGTGTA CGGTCAGATG AACGAGCCCC 
GCGCAGTCGG CTCTGACGAT GGCTGAGTAC 
AGATGTGCTT CTTTTTATCG ACAATATTTT 
CCGAGGTGTC GGCGCTTCTG GGTCGTATTC 
CCTACCCTCG CTGAGGATCT AGGGCAGTTG 
AACGAAAGGC TCCATTACTT CTGTGCAGGC 
ACATTAC CGA TCCAGCTCCA GCAACAACCT 
ACTGTGTTGG ACCGTGCTGT TGCCGAGTCT 
CCCACTGGAA TGCGCCTCGC GTATCATGGA 
ATCACTACAA TGTTGCACAA GATGTGGTAC 
GAATTACAGG ATATCATTGC TGTCCTTGGT 
GGACAAACTT ATCGTGGACC GTGCGCGTAA 
AGCCATTCCA AGTTGCTGAG GTCTTCACAG 
CAGTTGGATG ACACCATCGA TTCCTTTTCT 



PCT/CA00/01150 

GGGTGTGGGC AAAACCGTCA 4 00 

AGGGTCACGG TGGTTTCTCT 450 

GAGGGAACGG ATTTGTATCT 500 

TAAGGGTGAG TCCAAATGTG 550 

CAGGTGCCCG TGCGCGTGTT 600 

TTCCGTGATG TGGAGGGCCA 650 

TCGTTTCACT CAGGCTAACT 700 

CCGCCGCTGT TGGCTACCAG 750 

CAGGAGCGTA TTACCTCAAC 800 

CGTATACGTG CCGGCCGATG 850 

TCTCACATCT GGACGCCACA 900 

GGTATCTACC CCGCTGTTAA 950 

CCCCGACGTT ATCAGTGTGG 1000 

AGATGCTCAC CAAGTACAGG 1050 

ATCGACGAGC TAAGCGAGGA 1100 

GTTGGTGAAG TTCCTCTCCC 1150 

GAATGACTGG CCATTACGTG 1200 

GGTCTCCTCA TGGGTACG 1248 



2) INFORMATION FOR SEQ ID NO: 2191 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 440 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) .TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

* . * 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus nidulans 

(B) STRAIN: WSA-176 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2191 

GAGCTGATTG TAAGTCATCG ACCACTTGAT ATATGAATAC ATCTAACAGT 50 

AGTAGAACAA CATCGCCAAG GCTCACGGTG GTTACTCCGT CTTCACTGGT 100 

GTCGGTGAGC GTACTCGTGA GGGTAACGAT CTGTACCACG AAATGCAGGA 150 

GACTGGTGTC ATTCAGCTCG ACGGCGAATC CAAGGTGTCT CTTGTGTTCG 200 

GTCAGATGAA CGAGCCCCCA GGTGCTCGTG CCCGTGTCGC CCTTACTGGT 250 

CTGACCATCG CCGAATACTT CCGTGACGAG GAGGGTCAGG ACGTGCTGCT 3 00 

CTTCATTGAC AACATTTTCC GTTTCACCCA GGCCGGTTCC GAGGTGTCTG 350 

CCCTTCTTGG TCGTATCCCC TCTGCCGTCG GTTACCAGCC CACTCTGGCC 400 

GTCGACATGG GTGGTATGCA GGAACGTATT ACCACCACCA 440 



2) INFORMATION FOR SEQ ID NO: 2192 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 1262 bases 
-(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Leishmania 

(B) STRAIN: ATCC 50158 

(xi) SEQUENCE DESCRIPTION: SEQ 

GCTGACGGCG CTGGACGTGA CGGAGGACCT 
CGCTGGAGAT CGTGCAGCAC TTGGATGCGA 
ATGCAGACGA CGGACCTGCT GAAGCTGAAG 
CGGCAACATC TCCGTGCCGG TGGGCCGCGA 
ACGTGCTGGG CGACGCGATT GACCACCGCG 
CGCATGGCGA TCCACGCCGA GGCGCCGAAG 
GGACACGATC CTGACGACCG GCATCAAGGT 
ACTGCAAGGG CGGCAAGATC GGCCTGTTCG 
ACTGTGATCA TCATGGAGCT GATCAACAAC 
CTTCTCCGTG TTCGCCGGCG TTGGCGAGCG 
TGTACCTGGA GATGATGCAG TCAAAGGTGA 
AAGTGCGTGC TTGTGTACGG GCAGATGAAC 
GCGCGTTGCG CAGTCTGCGC TGACGATGGC 
AGGGCCAGAA CGTGCTGCTG TTCATCGACA 
GCGAACTCCG AGGTGTCTGC GCTGCTGGGC 
TTACCAGCCG ACGCTTGCGG AGGATCTTGG 
CGTCGACGAC GAAGGGATCG ATTACGTCTG 
GCGGATGATA TCACGGACCC CGCGCCCGCG 
CGCGACGACT GTGCTGGACC GCGCGGTGGC 
CCGTGAACCC GCTGGAGTGC GCGTCGCGCA 
GACGTGGACC ATTACAACGT TGCACAGGAT 
GTACAAGGAG CTGCAGGACA TCATTGCGGT 
GCGAGGAGGA CAAGGTCGTG GTGGACCGCG 
CTGTCGCAGC CGTTCCAGGT TGCGGAGGTG 
CTACGTGCAG CTGAGCGACA CGGTGGAGTC 
GGTCGTACGA CC 



panamensis 



ID NO: 2192 

CGGCCGCGAT GAGCCGCTGA 50 

ACACCGGCCG CTGCATTGCG 100 

TCGAAGGTTG TGTCGACCGG 150 

GACGCTGGGC CGCATCTTCA 200 

GCCCCGTGTG CGAGAAGATG 250 

CTGGCGGACC AGGCTGCGGA 3 00 

GATCGACCTG ATTCTGCCCT 350 

GCGGTGCCGG TGTGGGCAAG 400 

GTCGCGAAGG GGCACGGCGG 4 50 

CACGCGCGAG GGCACGGACC 500 

TTGACCTGAA GGGCGAGTCG 550 

GAGCCCCCGG GTGCGCGCGC 6 00 

CGAGTACTTC CGCGACGTGG 650 

ACATCTTCCG CTTCACGCAG 700 

CGCATCCCGG CCGCCGTGGG 750 

CATGCTGCAG GAGCGCATTA 800 

TGCAGGCTGT GTACGTGCCT 850 

ACGACGTTCT CGCACCTGGA 900 

GGAGTCGGGC ATCTACCCTG 950 

TCATGGACCC CGATGTGATC 1000 

ATCGTCCAGA TGCTGACCAA 1050 

GCTTGGCATC GACGAGCTGA 1100 

CGCGCAAGGT GACGCGGTTC 1150 

TTCACCGGCA TGACGGGCCA 1200 

GTTCTCTGGC CTGCTGATGG 1250 

1262 



2) INFORMATION FOR SEQ ID NO: 2193 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 912 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aspergillus nidulans 

(B) STRAIN: WSA-176 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2193 

TTCCGATGGT CAAATGTACG ATTGATATTC CTTCCAGCCA GTCAGGATAA 50 

CAGCTGATAC CAGTTGCAAA TAGGCCCCAG ACTCGTGAGC ACTTGTTGCT 100 

TGCCCGTCAG GTTGGTGTCC AGAAGATCGT TGTCTTCGTC AACAAGGTTG 150 

ACGCTGTCGA TGACCCTGAG ATGTTGGAGC TTGTTGAGCT CGAGATGCGT 200 
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GAGCTCCTCA ACACTTACGG TTTCGAGGGA GAGGAGACCC CTATCATCTT 250 

CGGTTCCGCC CTGTGCGCTC TCGAAGGCCG CCGCGAGGAC ATTGGTACTC 300 

AGCGTATTGA CTCCCTCCTC GAGGCCGTTG ACACTTGGAT CCCTACCCCC 350 

CAGCGTGACT TGGACAAGCC CTTCCTGATG TCCATTGAGG AAGTTTTCTC 4 00 

CATTGGTGGT CGTGGTACCG TCGCCTCTGG TCGTGTCGAG CGTGGTCTCC 450 

TCAAGAAGGA TACCGAAGTT GAAATTCACG GTGCTGATGG TATTCTGAAG 500 

ACCAAGGTCA CCGACATTGA GACCTTCAAG AAGAGCTGCG ATGAGTCTCG 550 

TGCTGGTGAC AACTCCGGTC TTCTCCTCCG TGGTATCCGT CGTGAGGATG 600 

TTCGTCGTGG TATGGTCATC GCTGCCCCTG GCTCCATCAA GGCCTCCAAG 650 

AAGTTCATGG TCTCCATGTA CGTCTTGACT GAGGCTGAAG GTGGCCGCAA 700 

GAACGGCTTC GGTGCCAACT ACCGCCCCCA GGCTTTCATC CGCACTGCTG 750 

GTAAGTTTCG AACTATTTGA TTCATTGATC ACGTCCCTAA CTGTTACTTT 800 

AGACGAGGCT TGCGACCTTC ATTTCCCTGA TGAGGCCGAC AAGGACCGCC 850 

ACGTCATGCC CGGTGACAAC GTCGAAATGG TCCTCAACCT CAACAACCCC 900 

GTTGCTGCTG AG 912 



2) INFORMATION FOR SEQ ID NO: 2194 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 887 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

-» 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aureobasidium pullulans 
• (B) STRAIN: WSA-234 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2194 



TCATCGTCGT CGCCGCTTCT GACGGTCAGA 
TTGCTTCTCG CCCGTCAGGT CGGTATCCAG 
CAAGGTCGAC GCCATCGAGG ACAAGGAGAT 
AGATGCGTGA GCTCCTCAGC AGCTACGGCT 
ATCGTCATGG GTTCCGCTCT TTGTGCCCTT 
TGGAACCACC CAGATCGACA ACCTGATGAA 
CCACTCCCCA GAGAGATCTT GAGAAGCCCT 
GTCTTCTCTA 'TCCCCGGTCG TGGTACTGTC 
TGGTACCCTG AAGAAGGATT CCGAAATCGA 
TCCCCATCAA GACCAAGGTC ACCGACATCG 
GACGAGTCCC GCGCTGGTGA CAACTCCGGT 
GCGTGAGGAT GTCAACCGTG GTATGGTTGT 
CCTCGCACAA GAAGTTCCTC GTCTCCATGT 
GGTGGTCGTC ACACTGGTTT CCACGGAAAC 
CCGTACCGCT GGTAAGTCCT GACTTTGAAC 
TCTAACATGT TTTACAGACG AGGCTGCCGC 
CCGAGGACGC TGACTCCAAG ATGGTCATGC 
GTCTGCTCTC TTCACAGACC TCTTGCCGTT 



TGCCCCAGAC CAGAGAGCAC 50 

AAGATTGTCG TCTTCGTCAA 100 

GCTTGAGCTC GTCGAGATGG 150 

TCGAGGGTGA CGAGACCCCC 2 00 

GAGAACCGCC AGCCCGAGAT 250 

CGCTGTCGAC GAGTGGATCC 300 

TCCTCATGTC CGTTGAGGAT 350 

GTTTCTGGCC GTGTTGAGCG 400 

GCTTGTCGGC AAGAACAAGG 450 

AGACCTTCAA GAAGTCTTGC 500 

CTTCTGCTCC GTGGTATCAA 550 

CGTCAAGCCC GGTACCGTCA 600 

ACGTCCTGAC CAAGGAGGAG 650 

TACCGTCCCC AGATCTTCAT 700 

TGCTGACCAA TTTTCGCATC 750 

TATTGACTGG CCCGAAGGCA 800 

CCGGTGACAA CGTCGAGATG 850 

GAACAGG 887 



2) INFORMATION FOR SEQ ID NO: 2195 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 984 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Emmons ia ' pa rva 

(B) STRAIN: ATCC 10784 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2195 

TGACGGCCAA ATGTAGGACT CTTGCGCGGG ACTGACTGAT TGGGGGGAAT 50 

CCATTTTTTC TTTTTTCTTT TTCTTTTTGA GTACATGATT ATACTAATAT 100 

TTGGATAACG TACCAGGCCT CAAACACGTG AACATTTACT CCTTGCCCGA 150 

CAAGTCGGTG TCCAGAAGAT CGTCGTTTTC GTAAACAAGG TCGATGTTCT 200 

CGAAGATAAG GAGATGTTGG AGCTTGTCGA GTTGGAAATG AGAGAGCTCT 250 

TGAACACCTA CGGATTCGAG GGTGAGGAAA CGCCCATCAT CTTCGGTTCT 300 

GCCCTTTGTG CCATGGAGGG CCGCGAGCCT GAGCTGGGCG AGAAGAGAAT 350 

TGATGAATTG CTCGACGCTG TTGATAGCTG GATCCCTACG CCGCAACGTG 400 

ATACGGAAAA ACCCTTCCTA ATGTCCATTG AGGAAGTGTT CTCCATCTCC 450 

GGTCGTGGAA CCGTTGCCTC CGGCCGTGTT GAGCGTGGTG TCCTCAAGAA 500 

GGATTCCGAA GTCGAGCTTG TTGGCGGCGG CGTCGCCCCA ATCAGGACCA 550 

AGGTAACCGA TATCGAAACC TTCAAGAAGT CCTG CGACGA GTCCAGGGCT 600 

GGAGACAACT CTGGCCTTTT GTTGCGTGGT GTCAAGCGTG AGGATATCCG 650 

CCGTGGTATG GTCGTTGTCG TTCCTGGCAG CGTCAAGGCC CACGACAAGT 700 

TCTTGGTGTC CATGTATGTT CTGACCGAAG CCGAGGGTGG TCGCCGAACT 750 

GGATTCGGCC AAAACTATCG TCCTCAAATG TTTATCCGCA CAGCTGGTAC 800 

GTAACGTATA ATGCCTCTCC TCTTCATATA TACCACCCCC CCCACCACTG 850 

ACTCCCTGAC TCTTCGATTA CAGACGAGGC CGCCGATCTC AGCTTCCCTG 900 

ACGCAGCAGA CGAAACCAAA CTGGTTATGC CCGGTGACAA CGTCGAGATG 950 

ATCCTCAAGA CACACCGCCC CATAGCTGCC GAAG 984 



2) INFORMATION FOR SEQ ID NO: 2196 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 806 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Exserohilum rostratum 

(B) STRAIN: WSA-215 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2196 

GCCGCCTCTG ACGG CCAAAT GCCCCAGACC CGTGAGCACT TGCTGCTTGC 50 

TCGCCAGGTC GGTGTTCAGA AGATTGTTGT TTTTGTCAAC AAGGTCGATG 100 

CCGTCGAGGA CAAGGAGATG TTGGAGCTCG TTGAGATGGA GATGCGTGAG 150 

TTGCTCAGCA GCTATGGCTT CGAGGGCGAT GAGACTCCCA TTGTCATGGG 200 
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CTCCGCTCTC TGCGCCATCG AGGGCCGTGA GCCCGAAATT GGTGTCAACC 250 

GTATTGATGA GCTGCTCGAG GCTGTCGACA CCTGGATCCC CACCCCTCAG 3 00 

CGTGATACCG ACAAGCCCTT CCTCATGGCT GTTGAGGACG TCTTCTCCAT 350 

TGCTGGCCGT GGTACCGTCG TTTCTGGCCG TGTCGAGCGA GGTGTCTTGA 400 

AGCGCGATGC CGAAGTCGAG CTGGTTGGCA AGGGCACTGC GCCCATCAAG 450 

ACCAAGGTTA CCGACATTGA AACCTTCAAG AAGTCGTGTG AAGAGTCTCG 500 

CGCCGGTGAC AACTCGGGTC TCCTCCTCCG TGGTGTCAAG CGTGACGACG 550 

TCCGCCGTGG CATGGTCGTT TCTGTTCCTG GACAAGTCAA GGCCCACAAG 600 

AAGTTCCTCG TCTCCATGTA TGTGTTGAGC AAAGAGGAAG GTGGCCGCCA 650 

CACGGGCTTC GGCGAGAACT ACAGGCCACA AATGTTCATC CGTACTGCTG 700 

ACGAGTCATG CGCGCTGCAC TGGCCAGAAG GTACCCCAGA TGCTCACGAC 750 

AAGCTTGTTA TGCCTGGTGA TAACGTTGAG ATGGTTTGTG AGCTTGACGT 800 

GCCACA 806 



2) INFORMATION FOR SEQ ID NO: 2197 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium moniliforme 

(B) STRAIN: WSA-213 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2197 

ACCCGTGAAC ACTTGCTCCT CGCTCGTCAG GTTGGTGTTC AGCGAATTGT 50 

CGTCTTTGTC AACAAGGTCG ATGCCATTGA TGACCCCGAG ATGCTTGAGC 100 

TCGTCGAGAT GGAGATGCGC GAGCTTCTTA ACACCTATGG CTT CGAAGGC 150 

GACGACACTC CCGTCATCAT GGGCTCGGCT CTCATGTCTC TCCAGAACCA 200 

GCGCCCTGAG ATTGGCACCG AGAAGATTGA TGAGCTTCTT GCTGCCGTCG 250 

ACGAGTGGAT CCCAACCCCC GAGCGTGACC TTGACAAGCC CTTCCTTATG 300 

TCCGTCGAGG ATGTCTTCTC CATTGCTGGC CGTGGTACCG TCGTGTCTGG 350 

CCGTGTGGAG CGTGGTGTTC TGAAGCGTGA CCAGGAGATC GAGCTTGTTG 400 

GAAAGGGTCA GGAGGTTATC AAGACCAAGG TTACCGACAT CGAGACCTTC 450 

AAGAAGTCTT GTGAGCAGTC CCAGGCTGGT GACAACTCTG GTCTCCTCAT 500 

CCGAGGTGTT CGCCGTGAGG ATGTCCGCCG TGGTATGGTC GTCTGCGCTC 550 

CTGGCACCGT GAAGTCTCAC ACCCAGTTCC TCGCTTCCCT CTATGTCCTC 600 

TCCAAGGAGG AGGGTGGCCG ACACACCGGT TTCCAGGAGC ACTACCGACC 650 

CCAGCTCTAC CTCCGAACCG CAGATGAGTC CATTGACCTG ACTTTCCCTG 700 

AGGGTACTGA GGATGCCTCC AGCAAGATGG TCATGCCTGG CGACAACACC 750 

GAGATGGTTG TCACCATGGG TCACCCCAAT GCCATCGAGG TTGGTCAGC 799 



2) INFORMATION FOR SEQ ID NO: 2198 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 819 bases 

(B) TYPE: Nucleic acid 
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(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Fusarium solani 

(B) STRAIN: ATCC 32793 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2198 

CTCTGACGGT CAGATGCCCC AGACCCGTGA GCACTTGCTG CTTGCCCGTC 50 

AGGTCGGTGT CCAGAAGATT GTCGTCTTCG TCAACAAGGT CGATGCCATT 100 

GACGACCCTG AGATGCTTGA GCTCGTCGAG ATGGAGATGC GTGAGCTCCT 150 

CAACACCTAC GGCTTCGAGG GTGACGAGAC CCCTGTCATC ATGGGCTCTG 200 

CTCTCATGTC CCTCCAGAAC CAGCGCCCCG AGATCGGTAG CCAGAAGATT 250 

GACGAGCTCC TTGCCGCCGT TGACGAGTGG ATCCCTACCC CCGAGCGTGA 300 

CCTTGACAAG CCCTTCCTCA TGTCCGTTGA GGATGTCTTC TCCATTGCCG 350 

GCCGTGGTAC CGTCGTCTCT GGCCGTGTCG AGCGTGGTGT CCTGAAGCGC 400 

GACCAGGAGA TTGAGCTCGT CGGCAAGGGT AACGAGGTCA TCAAGACCAA 450 

GGTCACCGAC ATTGAGACCT TCAAGAAGTC TTGCGAGCAG TCCCAGGCTG 500 

GTGACAACTC TGGTCTCCTC ATCCGAGGTG TCCGCCGTGA GGATGTCCGC 550 

CGTGGTATGG TCGTCTGCGC CCCCGGCACT GTCAAGTCCC ACACTCAGTT 600 

CCTTTCTTCC CTCTACGTCC TCACCAAGGA GGAGGGTGGC CGACACACTG 650 

GCTTCCAGGA GCACTACCGA CCCCAGCTCT ACCTCCGAAC TGCTGATGAG 700 

TCCATCGACC TGACCTTCCC CGAGGGTACC GAGGACGCCA GCAGCAAGAT 750 

GGTCATGCCC GGTGACAACA CCGAGATGGT CATCACCATG GGCCACCCCA 800 

ACGCCATTGA GGTCGGTCA 819 



2) INFORMATION FOR SEQ ID NO: 2199 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1025 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Histoplasma capsulatum 

(B) STRAIN: WSA-377 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2199 

TGCTGACGGC CAAATGTAAG ACGCCGCGAG GGAGTGCTGA AGGTTTTATG 50 

CTTTTTAGGC CCTTTGTTTC TGAGAGCATG ATGATACTAA TACTCGGAAA 100 

CGTATCTATT AGGCCTCAAA CACGTGAGCA TTTGCTCCTT GCCCGACAGG 150 

TCGGTGTCCA AAAGATCGTC GTTTTCGTGA ACAAAGTCGA CGCCCTTGAG 200 

GACAAGGAGA TGTTGGAGCT TGTCGAGTTA GAAATGAGAG AGCTCTTAAA 250 

CACCTACGGA TTTGAGGGTG AAG AG AC AC C CATCATCTTT GGTTCTGCCC 300 

TTTGCGCCAT GGAAGGCCGT GAGCCTGAGT TGGGAGAAAA GAAAATTGAT 350 

GAATTGCTGG AGGCTGTTGA TACTTGGATC CCAACACCAC AACGTGATAC 400 

CGAAAAACCT TTCTTGATGT CCGTTGAGGA AGTATTCTCT ATCTCCGGTC 450 

GTGGAACCGT TGCCTCCGGT CGTGTTGAGC GCGGTGTCCT CAAGAAGGAT 500 
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TCAGAAGTCG AGCTAATTGG GGGCGGCTCC 
AACTGATATC GAAACTTTCA AGAAATCCTG 
ACAACTCCGG TCTTTTATTG CGTGGTATCA 
GGTATGGTAG TTGCCGTTCC TGGCAGCGTC 
GGTGTCGATG TATGTCCTGA CCGAAGCTGA 
TCGGCCAGAA CTATCGTCCT CAAATGTTCA 
AAATGGGGCC CCTTTTCATA ATCCTTTCTT 
ATCTCTCTCT CTGTTTCTTT TCAACTCGCC 
CCCCTTTGAT TATAGACGAA GCCGCCCATC 
GATGAAAGCA AACTCGTTAT GCCTGGTGAC 
GACACACCGC CCCGTGGCTG CTGAG 
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ACCCCCATCA GGACGAAGGT 550 

TGACGAGTCT AGAGCTGGGG 600 

AGCGTGAAGA TATCCGCCGT 650 

AAGGCCCACG ACAAGTTCTT 700 

GGGTGGTCGC CGAACCGGAT 750 

TCCGCACAGC TGGTATGTCA 800 

TTTTTCCTTT TCCTCTCTCT 850 

TGATTCACGA AATTAACTAA 900 

TCAGCTTCCC TAGTGGAGCA 950 

AACGTCGAGA TGATCCTCCA 1000 

1025 



2) INFORMATION FOR SEQ ID NO: 2200 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 667 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Kocuria kristinae 

(B) STRAIN: ATCC 27570 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2200 



AGCACGTGCT GCTCGCCCGC CAGGTCGGCG 
CTGAACAAGG CCGACATGGT CGAGGACGAG 
GATGGAGGTC CGCGAGCTGC TGTCCTCCCA 
CCCCCGTCAT CCGCGTCTCC GCGCTGAAGG 
TGGGTCAAGT CCATCGAGGA GCTCATGGAG 
GGACCCCGTG CGCGACAAGG ACAAGCCGTT 
TCTTCACCAT CACCGGGCGC GGCACCGTGG 
GGGACCCTGG CCCTGAACTC CGAGGTCGAG 
CCAGAAGACC ACGGTCACCG GGATCGAGAT 
AGGCCTGGGC CGGCGAGAAC TGCGGTCTGC 
GACGACGTCG AGCGCGGCCA GGTCGTGGTG 
GCACACCAAC TTCGAGGCGA ACGTCTACAT 
GGCGTCACAA CCCGTTCTAC TCGAACTACC 
ACCACCGACG TCACCGG 



TGCCGACCCT GCTGGTCGCC 50 

GAGCTGCTGG ACCTCGTCGA 100 

GGAGTTCGAC GGCGACAACG 150 

CGCTGGAGGG CGACGAGAAG 200 

GCCGTGGACG AGTACATCCC 250 

CCTGATGCCC ATCGAGGACG 300 

TGACCGGTCG CGCCGAGCGC 350 

ATCGTCGGCA TCCGCCCGAT 400 

GTTCCACAAG CAGCTCGACG 450 

TGCTGCGCGG CCTGAAGCGC 500 

AAGCCGGGTT CCATCACCCC 550 

CCTGTCCAAG GACGAGGGTG 600 

GTCCGCAGTT CTACTTCCGG 650 

667 



2) INFORMATION FOR SEQ ID NO: 2201 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 778 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Vibrio mimicus 

(B) STRAIN: ATCC 33653 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2201 

TGCTGCAACA GATGGTCCAA TGCCACAAAC TCGTGAGCAC ATCCTGCTGG 50 

GTCGCCAAGT AGGTATTCCT TACATCATCG TATTCATGAA CAAATGTGAC 100 

ATGGTTGACG ATGAAGAGCT TCTAGAGCTG GTTGAGATGG AAGTTCGTGA 150 

GCTTCTGTCT GAGTACGATT TCCCAGGTGA TGACCTGCCA GTAATCCAAG 200 

GTTCAGCACT AGGCGCGCTA AACGGCGAAG CACAGTGGGA AGCGAAGATT 250 

GTTGAACTAG CAGAAGCACT AGATTCATAC ATTCCAGAGC CAGAGCGTGC 300 

AGTAGACATG GCATTCCTGA TGCCAATCGA AGACGTATTC TCAATCCAAG 350 

GTCGTGGTAC AGTAGTAACT GGCCGTATCG AGCGCGGCAT CCTGAAAGTG 400 

GGTGACGAAG TTGCGATCGT TGGTATCAAA GACACAGTAA AAACTACCTG 450 

TACAGGTGTA GAAATGTTCC GTAAGCTGCT TGACGAAGGT CGTGCAGGTG 500 

AGAACGTTGG TGCACTGCTA CGTGGTACTA AGCGTGAAGA AGTAGAGCGT 550 

GGTCAAGTAC TGGCGAAGCC AGGTTCAATC ACCCCACACA CTAAGTTCGA 600 

ATCAGAAGTA TACGTACTGT CAAAAGACGA AGGTGGCCGT CATACTCCAT 650 

TCTTCAAAGG TTACCGTCCA CAGTTCTACT TCCGTACAAC TGACGTAACA 700 

GGCAGCATCG AGCTTCCAGA AGGCGTAGAA ATGGTAATGC CAGGCGACAA 750 

CATCAAGATG GTTGTAGACC TGATTGCA 778 



2) INFORMATION FOR SEQ ID NO: 2202 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Citrobacter freundii 

(B) STRAIN: ATCC 8090 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2202 

AACGCTGACC CTGCAGGTTA TTGCCGCTGC GCAGCGTGAA GGTAAAACCT 50 

GTGCATTTAT CGATGCAGAA CACGCACTGG ACCCGGTCTA TGCCCGTAAG 100 

CTTGGCGTTG ATATCGATAA CCTGCTGTGT TCTCAGCCGG ATACCGGTGA 150 

ACAAGCGCTG GAAATCTGTG ATGCACTGGC GCGCTCCGGT GCGGTTGACG 200 

TTATCGTTGT CGACTCCGTT GCCGCATTGA CGCCGAAGGC AGAAATCGAA 250 

GGCGAGATTG GCGACTCTCA CATGGGCCTT GCGGCGCGTA TGATGAGCCA 300 

GGCGATGCGT AAGCTGGCCG GTAACCTGAA GCAGTCCAAC ACGCTGCTGA 350 

TTTTCATCAA CCAGATCCGT ATGAAGATTG GCGTTATGTT CGGTAACCCG 400 

GAAACCACCA CC 412 



2) INFORMATION FOR SEQ ID NO: 2203 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 337 bases 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium botulinum 

(B) STRAIN: 20:1.2 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2203 

TTTAGATCCA TCTTATGCTA GAAATTTAGG TGTTGATATA GATAACCTAA 50 

TAGTTTCTCA ACCAGATACA GGAGAACAGG CTTTAGAGAT AACAGAAGCT 100 

TTAGTAAGAT CAGGAGCAGT AGATGTTATA GTTGTAGACT CTGTAG CAGC 150 

TTTAGTTCCT AGGGCAGAAA TAGAAGGAGA AATGGGAGAC TCACATGTAG 200 

GTCTTCAAGC AAGACTTATG TCTCAAGCCC TAAGAAAATT AGCAGGATCT 250 

ATAAATAAAT CTAAGTGTGT AGCTATATTT AT AAAC CAAT TAAGAGAAAA 300 

GGTTGGTATA ATGTTTGGAA ATCCAGAAAC AACTCCT 337 



2) INFORMATION FOR SEQ ID NO: 2204 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS:. Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Francisella 

(B) STRAIN: ATCC 29684 

(xi) SEQUENCE DESCRIPTION: SEQ 

AAAGCAAGGC GGTACTGCAG CATTTGTTGA 
CAAAATATGC AAAGCTTTTA GGTGTTGATG 
CAGCCGGATA CGGGTGAGCA AGCTTTAGAG 
TTCTGGAGGA GTTGATATTG TAGTAATTGA 
CAAAGGCAGA GATTGAGGGT GACATGGGCG 
GCAAGATTAA TGTCACAAGC ACTAAGAAAA 
CTCAAATACT CTAGTGATAT TCATTAACCA 
TTATGTTTGG TAACCCTGAA ACTACAACT 



tularensis 



ID NO: 2204 

TGCTGAGCAT GCACTAGATC 50 

TTGATAATCT GATCGTGTCA 100 

ATTGCTGATA TGTTGGTACG 150 

CTCTGTTGCT GCACTTACGC 200 

ACTCGCACAT GGGCTTACAA 250 

CTAACGGCAA ATATCAAGCG 3 00 

AATTCGTATG AAGATCGGGG 350 

379 



2) INFORMATION FOR SEQ ID NO: 2205 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peptostreptococcus anaerobius 

(B) STRAIN: ATCC 27337 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2205 



ACTTGACCCA GTATATGCAA GGGCTCTTGG AGTGGATATA GACAACCTAG - 50 

TCATATCTCA GCCAGATACA GGAGAACAGG CCCTAGATAT AGCAGAGTCC - 100 

CTTATAAGAT CAGGAGCTGT AGATATACTA GTAATAGACT CAGTAGCTGC - 150 

CCTAGTACCT AAGGCAGAAA TAGAAGGTGA CATGGGAGAT TCTCACGTAG - 200 

GTCTACAGGC TAGACTTATG TCACAGGCAC TTAGAAAATT GACTGGATCT - 250 

ATAAAGAAGT CAAACTGTGT TGTTATATTT ATCAACCAGT TGAGAGAAAA- 3 00 

AGTAGGGGTT ATGTTCGGTA ATCCAGAGAC AACAACA 337 



2) INFORMATION FOR SEQ ID NO: 2206 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Peptostreptococcus asaccharolyticus 

(B) STRAIN: LSPQ 2639 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2206 



TCTTGAT.GCT GGATATGCAA AAAACCTTGG AGTAGATGTA GAAAATTTAA - 50 

TTATTTCTCA ACCTGATACA GGTGAGCAAG CCTTAGAAAT AACTGAAGCT - 100 

CTTGTAAGAT CTAACGCTGT TGATTTAATT ATTATAGACT CAGTTGCCGC - 150 

ACTTGTACCA AAAGCAGAAA TCGATGGTGA CATGGGAGCT GCACAAATAG - 200 

GTCTTCAAGC AAGACTTATG TCTCAAGCTC TTAGAAAATT AACTGGGGCA- 250 

ATCAACAAGT CAAAATGTAC CGTTGTATTT ATTAACCAAC TTAGAGAAAA- 3 00 

AGTTGGTATC ATGTTTGGTA ACCCAGAAAC TACAACA 337 



2) INFORMATION FOR SEQ ID NO: 2207 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 408 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Providencia stuartii 
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(B) STRAIN: ATCC 33672 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2207 



CTCACGTTGC AAGTTATTGC AGCAGCACAA CGTAGCGGAA AAACCTGTGC 
ATTTATCGAC GCTGAACATG CGCTAGATCC AATCTATGCG AAAAAACTGG 
GTGTTGATAT CGATAACCTT CTATGTTCTC AACCTGATAC TGGTGAGCAA 
GCATTAGAGA TTTGTGATGC ACTGACGCGT TCAGGCGCTG TTGATGTCAT 
TATCGTTGAC TCCGTGGCCG CATTAACACC AAAAGCTGAA ATTGAAGGTG 
AAATCGGTGA CTCACACATG GGCTTAGCGG CTCGTATGAT GAGCCAAGCG 
ATGCGTAAAT TAGCGGGTAA CTTAAAGAAC TCGAATACAC TTTTAATCTT 
CATTAACCAA ATCCGTATGA AGATTGGCGT TATGTTTGGT AACCCAGAAA 
CCACTACA 



50 
100 
150 
200 
250 
300 
350 
400 
408 



2) INFORMATION FOR SEQ ID NO: 2208 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 403 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Paratyphi A 

(B) STRAIN: ATCC 9150 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2208 

GCTGCAGGTG ATTGCCGCTG CGCAGCGTGA AGGTAAAACC TGTGCGTTTA 50 

TCGATGCGGA ACACGCGCTT GACCCTGTTT ACGCACGCAA GCTGGGCGTC 100 

GATATCGATA ACCTGCTTTG TTCTCAGCCG GATACCGGCG AGCAGGCGCT 150 

GGAAATCTGT GACGCGCTGG CGCGTTCAGG CGCGGTGGAC GTCATTGTGG 200 

TCGACTCCGT AGCGGCGCTA ACGCCGAAAG CGGAAATCGA AGGCGAAATT 250 

GGCGACTCTC ACATGGGCCT CGCGGCGCGT ATGATGAGCC AGGCGATGCG 300 

TAAGCTGGCG GGGAACCTAA AACAGTC CAA CACGCTGTTG ATTTTCATCA 350 

ACCAGATCCG TATGAAGATT GGCGTGATGT TCGGTAACCC GGAAACCACC 400 

ACC 403 



2) INFORMATION FOR SEQ ID NO: 2209 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: Salmonella choleraesuis subsp. choleraesuis 
serotype Typhimurium 

(B) STRAIN: ATCC 14028 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2209 



GACCCTGACG CTGCAGGTGA TTGCCGCTGC GCAGCGTGAA GGTAAAACCT 50 

GTGCGTTTAT CGATGCGGAA CACGCGCTTG ACCCTGTTTA CGCACGCAAG 100 

CTGGGCGTCG ATATCGATAA CCTGCTCTGC TCTCAGCCGG ATACCGGCGA 150 

GCAGGCGCTG GAAATCTGTG ACGCGCTGGC GCGTTCAGGC GCGGTGGACG 200 

TCATTGTGGT CGACTCCGTA GCGGCGCTAA CGCCGAAAGC GGAAATCGAA 250 

GGCGAAATCG GCGACTCTCA CATGGGCCTC GCGGCGCGTA TGATGAGCCA 300 

GGCGATGCGT AAGCTGGCGG GGAACCTGAA ACAGTCCAAC ACGCTGTTGA 350 

TTTTCATCAA CCAGATCCGT ATGAAGATTG GCGTGATGTT CGGTAACCCG 400 

GAAACCACCA CC 412 



2) INFORMATION FOR SEQ ID NO: 2210 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 337 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus saprophyticus 

(B) STRAIN: ATCC 15305 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2210 



ATTAGATCCT GTTTACGCAG AAGCTTTAGG CGTAGATATA CAAAATTTAT- 50 

ATTTATCTCA ACCTGATCAT GGGGAACAAG GTTTAGAAAT TGCCGAAGCA- 100 

TTTGTTAGAA GTGGCGCTGT TGATATCGTT GTGGTCGATT CAGTTGCTGC - 150 

GCTTACACCT AAAGCTGAAA TTGAAGGTGA AATGGGAGAT ACGCACGTTG - 200 

GTTTGCAAGC ACGTCTTATG TCCCAAGCCT TGAGAAAGCT TTCCGGTGCA - 250 

ATTTCAAAAT CAAATACAAC AGCAGTATTT ATCAACCAAA TCCGTGAAAA- 3 00 

AGTTGGTGTG ATGTTCGGTA ATCCTGAAGT TACACCA 337 



2) INFORMATION FOR SEQ ID NO: 2211 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia pseudotuberculosis 

(B) STRAIN: ATCC 29833 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2211 



GACACTGACA TTACAGGTTA 
GTGCATTTAT CGATGCCGAA 
TTGGGTGTAG ATATTGATAA 
GCAGGCACTG GAAATTTGTG 
TTATCATCGT TGACTCCGTA 
GGTGAAATTG GCGATTCTCA 
GGCTATGCGT AAGCTGGCGG 
TTTTTATCAA CCAAATCCGC 
GAAACCACTA CC 



TCGCCGCCGC ACAGCGTGAA GGCAAAACGT 
CATGCCCTTG ACCCAATCTA TGCCAAGAAA 
CCTACTGTGT TCTCAGCCAG ATACTGGCGA 
ATGCGCTGAC TCGCTCTGGT GCGGTTGACG 
GCGGCATTGA CACCAAAAGC TGAAATTGAA 
TATGGGCCTT GCCGCGCGTA TGATGAGCCA 
GTAACCTGAA GAATGCGAAT ACCTTACTGA 
ATGAAAATTG GCGTGATGTT TGGTAACCCA 



50 
100 
150 
200 
250 
300 
350 
400 
412 



2) INFORMATION FOR SEQ ID NO: 2212 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 404 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zoogloea ramigera 

(B) STRAIN: ATCC 25935 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2212 

CACGCTCACC TTGCAAACCA TCG CGGAAAT GCAAAAACTG GGCGGCACCT 50 

GCGCGTTTAT CGACGCCGAG CACGCACTGG ACGTCACGTA CGCGCAAAAG 100 

CTGGGCGTCA ACCTGAGCGA TCTGCTGATC TCGCAACCGG ACACCGGCGA 150 

ACAAGCGCTG GAAATCTGCG ACGCCCTGGT GCGTTCCGGT TCGGTGGACA 200 

TGGTCGTGAT CGACTCGGTC GCCGCGCTGA CCCCGCGCGC CGAGATCGAA 250 

GGCGACATGG GCGATTCGCT GCCAGGTTTG CAGGCACGTT TGATGTCGCA 3 00 

AGCACTGCGC AAGCTTACCG GTTCGATCAA CCGCACCAAC ACCCTGGTCA 350 

TCTTCATCAA CCAGATCCGC ATGAAAATCG GCGTCATGTT CGGCAGCCCG 400 

GAAA 404 



2) INFORMATION FOR SEQ ID NO: 2213 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 42 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2213 
CGTGCCATTG ACATGATTTC CGAAGAAGAC GCTGAAGGCA CG 
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2) INFORMATION FOR SEQ ID NO: 2214 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 125 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY : Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Abiotrophia adiacens 

(B) STRAIN: ATCC 4 9175 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2214 

CAACTACATT ACGTTCTGCA ACACAAGGTC GTGGTACTTT CAGTATGACA 50 
TTTGACCACT ATGAAGATGT TCCTAAGAGC ATTGCAGAAG AAATCATCAA 100 
GAAAAATGGC GGTAACGGAG AATAA 125 



2) INFORMATION FOR SEQ ID NO: 2215 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 140 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: AcinetoJbacter baumannii 

(B) STRAIN: ATCC 19606 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2215 

CGACTCAAAT GCGTTCTATG TCTCAAGGTC GTGCGACATA CTCAATGGAA 50 
TTTGCTAAAT ATGCTGAAAC TCCACGTAAC GTGGCTGAAG GCATCATCGC 100 
TAAATTCCAA GCTGGCGGTA AAAAAGGTGA CGACGAGTAA 140 



2) INFORMATION FOR SEQ ID NO: 2216 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Actinomyces meyeri 

(B) STRAIN: ATCC 35568 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2216 



CCGGTGACCT GCGTTCTAAG ACGCAGGGTC GCGCTGTCTA CTCCATGGAG 
TTCGACAGCT ACGCCGAGGT TCCGCGCGCG GTCGCGGATG AGATCGTCGG 
CAAGTCTCGG GGCAACTGA 



50 
100 
119 



2) INFORMATION FOR SEQ ID NO: 2217 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 

(B) STRAIN: ATCC 9689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2217 

CGACCGACGT TCGCTCCATG TCCCAGGGTC GCGCAAGCTA CTCTATGGAA 50 
TTCAAAAAAT ACAACACAGC TCCGGCGCAC ATCGCTGAAA CTGTATCCAA 100 
AAAACAAGGC TGA 113 



2) INFORMATION FOR SEQ ID NO: 2218 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium diphtheriae 

(B) STRAIN: ATCC 27010 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2218 

GCGACCTGCG TTCCCGTACC CAGGGCCGTG CAAACTACAC CATGATCTTC 50 
GACTCCTACG CTGAGGTTCC TACCAACGTG GCAGCTGAGA TCGTGGCAGA 100 
GCGCAACGGC ACTGCCTAA 119 



2) INFORMATION FOR SEQ ID NO: 2219 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: En terobacter cloacae 

(B) STRAIN: ATCC 13047 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2219 

AACTCAGCTG CGTTCTCTGA CCAAAGGTCG TGCATCATAC ACCATGGAAT 50 
TCCTGAAGTA TGATGATGCG CCTAACAACG TTGCTCAGGC CGTTATTGAA 100 
GCCCGTGGTA AGTAA 115 



2) INFORMATION FOR SEQ ID NO: 2220 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae pneumoniae 

(B) STRAIN: ATCC 13883 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2220 

AACTCAGCTG CGTTCTCTGA CCAAAGGTCG TGCATCATAC ACCATGGAAT 50 
TCCTGAAGTA TGATGATGCG CCGAACAACG TTGCTCAGGC CGTTATTGAA 100 
GCCCGTGGTA AATAA 115 



2) INFORMATION FOR SEQ ID NO: 2221 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 113 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria monocytogenes 

(B) STRAIN: ATCC 15313 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2221 



CAACTCACCT TCGTTCAGGT ACGCAAGGTC GTGGTGTATA CACTATGCAA 50 
TTTGACCACT ATGAAGAAGT TCCTAAATCT ATTGCTGAAG AAATCATTAA 100 
AGCTAATGGT GGA 113 



2) INFORMATION FOR SEQ ID NO: 2222 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium avium 

(B) STRAIN: ATCC 25291 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2222 



CGGCGACCTG CGGTCCAAGA CCCAAGGCCG GGCGAACTAC TCCATGGTCT 50 

TCGACTCCTA CGCCGAAGTG CCGGCCAACG TGTCGAAGGA GATCATCGCG 100 

AAGGCGACGG GTCAGTGA 118 

2) INFORMATION FOR SEQ ID NO: 2223 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium gordonae 

(B) STRAIN: Mgor-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2223 



CCGGCGACCT GCGGTCCAAG ACGCAAGGCC GGGCGAACTA CTCCATGGTG 50 
TTCGACTCGT ACGCCGAAGT TCCGGCGAAC GTGTCGAAGG AGATCATCGC 100 
GAAGGCGACG GGCGAATAG 119 



2) INFORMATION FOR SEQ ID NO: 2224 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium kansasii 

(B) STRAIN: Mkan-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2224 

CGGCGACCTG CGGTCCAAGA CTCAAGGCCG GGCGAACTAC TCGATGGTGT 50 
TCGATTCCTA CGCCGAAGTG CCGGCTCAGG TGTCGAAGGA GATCATCGCG 100 
AAGGCGACTG GCGAGTGA 118 



2) INFORMATION FOR SEQ ID NO: 2225 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM:. Mycobacterium terrae 

(B) STRAIN: Mter-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2225' 

CGGAGACTTG CGGTCGAAGA CCCAGGGCCG GGCGAACTAC TCCATGGTGT 50 
TCGACTCCTA CGCCGAAGTG CCGGCGCAGG TGGCGAAGGA GATTATCGCG 100 
AAGGCAACGG GCGAGTAA 118 



2) INFORMATION FOR SEQ ID NO: 2226 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 bases 

(B) TYPE: Nucleic acid 

( C ) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria polysaccharea 

(B) STRAIN: ATCC 43768 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2226 

GACCGACCTG CGTTCTGCAA CCCAAGGCCG CGCTACTTAC TCTATGGAGT 50 

TCAAGAAATA TTCTGAAGCT CCTGCCCACA TAGCTGCTGC TGTAACTGAA 100 

GCCCGTAAAG GCTAA 115 
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2) INFORMATION FOR SEQ ID NO: 2227 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: ATCC 14990 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2227 

AACTTCATTA CGTTCTAACA CGCAAGGTCG CGGTACTTAC ACAATGTACT 50 
TTGACCACTA TGCAGAAGTT CCTAAATCAA TTGCTGAAGA AATCATCAAG 100 
AAAAATAAAG GTGAATAA 118 



2) INFORMATION FOR SEQ ID NO: 2228 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 118 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: ATCC 29970 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2228 

AACTTCATTA CGTTCTAACA CTCAAGGTCG CGGTACTTAC ACTATGTACT 50 
TCGATCACTA TGCAGAAGTT CCAAAATCAA TTGCTGATGA TATCATCAAA 100 
AAAAATAAAG GTGAATAA 118 



2) INFORMATION FOR SEQ ID NO: 2229 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1630 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Succinivibrio dextrinosolvens 

(B) STRAIN: ATCC 19716 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2229 

CGTTCAATGC GTGTTCTAGA CGGTGCAGTT ATGGTTTACT GTGCTGTGGG 50 

TGGTGTTCAG CCTCAGTCTG AAACCGTTTG GAGACAGGCT CAGAAGTACA 100 

AGGTTCCTCG TATTGCTTTC GTTAATAAGA TGGACCGTAC TGGTGCTAAT 150 

TTCCTACGTG TTGTAGAGCA GATTAAGACC CGTCTAAAGG GTAACCCTGT 2 00 

TCCTCTAATG TTACCTATCG GTAAAGAGGA CAGCTTTGTT GGTGTAGTTG 250 

ACCTAATCAA GCGTAAGGCT ATCGACTGGG ATGAGGCATC TCAGGGTATG 3 00 

AAGTTTGAGT ACGTTGACAT TCCAGCAGAT ATGGTTGAGG AAGTTGAAGA 3 50 

GTGGCGTGCA AAGCTTGTAG AAGCAGCTGC AGAAGCTAAC GACGAGCTGA 400 

TGGATAAATT CTTCGGTGGT GAAGAGCTGA CCGAGGAAGA GATCAAGGCT 450 

GCTCTACGTG AGCGTACTCT TCGCAACGAA ATTATTCCTA TGTGCTGCGG 500 

TTCAGCATTT AAGAACAAGG GTGTTCAGGC AATGCTTGAC GCTGTTGTTG 550 

AGTATCTTCC ATCTCCAGCA GATGTTCCTG CTGTTGAGGG TAAGACCCTA 600 

ACCGGTGAAG CTGATACTCG TAAGGCTGAC GATAAAGAGC CATTCTCTGC 650 

TTTAGCATTT AAGCTAGCAA ATGACCCATT CGTAGGTAAC TTAACATTCT 700 

TACGTTGCTA CTCAGGCTTT ATTAAGTCTG GTGACACTGT AATGAACTCA 750 

GATAAGCAGA AGCGTGAGCG TTTCGGCCGT CTAGTTCAGA TGCACGCTAA 800 

TGCTCGTAAT GAGGTTAGCG AGGTTTATGC AGGTGACATC GTTGCTGCTA 850 

TTGGTCTGAA GGAAACCGTT ACCGGTGATA CCTTATGTGA CCCAGAGCAT 900 

CCAATCATTC TTGAGTCAAT CGACTTTGCA GAGCCAGTTA TCTCTGTAGC 950 

AGTTGAGCCT AAGACCAAGG ACGATCAGGA GAAGATGGCT CTTGCTTTAC 1000 

AGCGTTTAGC AAAAGAAGAT CCTTCATTCC GCGTTCGTAC AGACGAAGAG 1050 

TCTGGCCAGA CCATTATTTC TGGTATGGGT GAGCTTCACC TAGACATCAT 1100 

TGTTGACCGT CTACGCCGTG AGTTCAAGGT TGAGTGTAAT CAGGGTAAGC 1150 

CACAGGTTGC ATACCGTGAG ACCATTAAGA GCAAGGTTGA ACAGCAAGGT 12 00 

AAGTTTGCTC GTCAGTCTGG TGGTCGTGGT CAGTACGGTG ACTGCTGGTT 1250 

ACGTATGGAA CCTCTTGAGC CAGGTAAGGG CTACGAATTC GTGAATGAGA 13 00 

TTGTTGGTGG TGTAATTCCT AAGGAATATA TCCCTGCAAT TGATAAGGGC 1350 

TGTCAGGAGC AGATCGCTAA CGGTGTTCTA GCTGGTTTCC CAGTTGTTGA 1400 

CAT CAAGATC ACTGTATTCG ATGGTTCTTA CCACGAAGTT GACTCTTCAG 1450 

AAATGGCATT CAAGATTGCT GCTTCTATGG CATTCAAAGA GGGCTTCAAG 1500 

AAGGCAAATC CTGTTCTTCT AGAGCCTTTA ATGAAGGTAG AAGTTGATAC 1550 

TCCTGAAGAC TACATGGGTG ACGTTATTGG TGACTTAAAC CGTCGTCGTG 1600 

CTATCGTTGA AGGCATGGAA GATGGTCCTA 1630 



2) INFORMATION FOR SEQ ID NO: 2230 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1662 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Tetragenococcus halophilus 

(B) STRAIN: ATCC 33315 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2230 
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TTGAAGTGCA ACGTTCATTG CGTGTGCTAG 
GACTCACAAT CAGGTGTTGA ACCACAAACT 
AACAGAATAT CAAGTACCTC GTATTGTATT 
TTGGCGCAGA TTTCTTATAT TCTGTCCGAT 
GCTAATGCAC AACCTATCCA ATTGCCAATT 
AGGTATCATC GACCTTGTGA AGATGAAAGC 
TAGGGACTAC TTTCGAAGAA ACTGAAATCC 
GCTCAAGAAT GGCATAATAA TTTGGTAGAA 
AGATATCATG ATGAAATACT TGGAAGGTGA 
TACAAGCAGG TATTCGTAAA GCAACATTAT 
TTATGTGGTT CTGCATTTAA AAACAAAGGT 
AGTAATTGAT TACTTGCCTT CTCCAACCGA 
TCGATCCGAA AACAGATGAA GAAACTGAAC 
CCTTTTTCAT CACTTGCTTT TAAAGTTATG 
CTTAACTTTC TTCCGTGTTT ATTCAGGTGT 
TATTGAATGC TACTAAGGGT TCACGTGAAC 
ATGCATGCCA ATTCTCGTTC TGAGATCGAT 
TGCAGCTGCT GTAGGCTTGA AGAACACTAC 
ATGAGAAAAA TCCAGTTATT TTGGAAACTA 
ATTCAAGTTG CTGTTGAACC TAAGTCAAAA 
CGTAGCACTA CAAAAACTTG CAGAAGAAGA 
CCAACGCTGA AACTGGCGAA ACTGTAATTG 
TTAGACGTTC TTATTGACCG TATGAAGACT 
TATTGGTGCA CCACAAGTTT CTTATCGTGA 
AAGCTGAAGG GAAATTTATC CGCCAATCTG 
CACGTATGGG TTGAATTTAC TCCAAACGAA 
TAAAAACTCC ATTGTTGGTG GGGTTGTCCC 
TACAAAAAGG ACTGGAAGAC GCCATGGAAA 
CCATTAGTTG ACGTAAAGGC AGAACTGTTT 
CGACTCTAAT GAAACAGCCT TCCGTATTGC 
AAGCTGCGAA AAAGGCAGAT CCAGTTATTC 
ACAATTAGTA TCCCTGAAGA ATATCTAGGT 
AGCTCGTCGT GGTCGTGTTG AAGGAATGGA 
CTGTAAATGC GT 



ACGGTTCAGT TACTGTCTTG 50 

GAAACAGTTT GGCGTCAAGC 100 

CTGTAATAAA ATGGATAAGG 150 

CATTGCATGA TCGTTTAGAA 200 

GGTGCTGAAG ATAACTTTGA 250 

TGAATTTTAT AAAGATGATT 3 00 

CAGATGAATA TAAAGAAACA 350 

TCTGTAGCTG ATTTTGATGA 400 

AGAAATTACA CCTGAAGAGT 450 

CTGTTGAATT TTACCCAGTA 500 

GTTCAAATGA TGTTGGATGC 550 

CGTTCCCCCA ATTAAAGGGA 600 

ATCCTGCTGA " TGATAGTGAG 650 

TCAGACCCTT ATGTTGGCCG 700 

GTTGGATACA GGTTCTTATG 750 

GAATTGGTCG TATTTTGCAA 800 

AAGGTTTATT CAGGTGACAT 850 

AACAGGGGAT AC C CTTTGTG 900 

TCAACTTCCC TGAACCAGTA 950 

GCTGACCAAG ATAAAATGAG 1000 

CCCATCTTTT AAAGTGGAAA 1050 

CTGGTATGGG TGAACTTCAA 1100 

GAATTTAAAG TGGATGCCAA 1150 

AACTTTCCGT TCATCAACTA 1200 

GTGGTAGAGG TCAATACGGT 1250 

GAAGGAGCAG GATTCGAATT 1300 

TCGTGACTAT ATACCTGCAG 13 50 

ATGGTGTGTT AGCTGGTTAT 1400 

GATGGTTCTT ACCATGACGT 1450 

GGCTTCAATG TCTCTACGTG 1500 

TTGAACCGAT GATGAAAGTA 1550 

GATATTATGG GACATGTTAC 1600 

TGCTCACGGT AATGCACAAA 1650 

1662 



2) INFORMATION FOR SEQ ID NO: 2231 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1652 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Veillonella parvula 

(B) STRAIN: ATCC 10790 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2231 

AAGTAGAACG TTCTCTACGT GTACTTGACG GTTCTGTTGC GGTGTTCAGT 50 
GCTAAAGGTG GCGTTGAACC TCAATCCGAA ACAGTATGGC GTCAGGCTTC 100 
TAACTACGGC GTACCTCGTA TCGCTTATGT AAATAAGATG GATACTGTAG 150 
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GTGCTGACTT CTTCAACGTA GTTGACATGA TGAAAGCTCG TTTGGGTGCA 2 00 

AATTCCGTAG CTATCCAAGT ACCAATCGGT GCTGAAGATA CTTTCGAAGG 250 

CATCATTGAC TTGATGACTA TGAAAGCGGA AATTTATAAA TCCGATGACG 3 00 

GTAAAGAATA TGAAATCACT GATATCCCTG CTGAATATCA AGAAGTAGCA 350 

GAAGCTCGTC GCGAAATGAT GATCGATGCT ATCGCTGAAA CAGATGATGA 400 

TATCATGATG AAATATTTGG AAGGCGAAGA AATTTCTGTG GAAGAATTGA 450 

AAGCGGCATT GCGTAAAGCT GTTATTGCTA ACCAATTATT CCCAGTTCTT 500 

TGTGGTTCTT CCTATAAAAA TAAAGGTGTT CAAATGTTAT TGGATGCTGT 550 

TATCGATTAC ATGCCAGCTC CAATCGACAT CCCACCTATT AAAGGTGTTG 600 

TTCCTGGTAC TGAAGAAGAA ACAACTCGTC CTTCTTCCGA TGAAGAGCCA 650 

TTCTCTGCAT TGGCATTCAA AATCATGGCT GACCCTTATG TTGGTAAATT 700 

AGCGTTCTTC CGTGTGTACT CCGGTACATT GGAATCTGGC TCCTACGTTT 750 

TCAACTCCAC TAAAGGTAAA AAAGAACGTA TCGGTCGTAT TCTTCAAATG 800 

CACGCTAACT CCCGTAAAGA AATCGAACGC GTATATTCTG GTGACATCGC 850 

TGCGGCGGTT GGCTTAAAGG ATACTACTAC AGGCGACACA TTGTGTGATG 900 

AAAAATCTCC TGTAATCCTT GAGTCCATGG AATTCCCTGA ACCAGTTATC 950 

TCCGTTGCTG TTGAACCTAA AACAAAAGCT GACCAAGAAA AAATGGGTAC 10 00 

AGCTCTTGCT CGTTTGGCAG AAGAAGATCC TACTTTCAAA GTTCGTACTG 1050 

ATGAAGAAAC AGGTCAAACT ATTATCTCTG GTATGGGCGA ACTTCACTTG 1100 

GATATCATCG TTGACCGTAT GAACCGTGAA TTCAAAGTAG ATTGTAACGT 1150 

AGGTAAACCT CAAGTAGCAT ACCGCGAAAC TATCCGTAAA GCTGTTAAGG 1200 

CTGAAGGTAA ATTCGTACGT CAATCTGGTG GTCGTGGTCA ATATGGTCAC 1250 

TGCTGGTTGG AATTGATTCC TCAAGAACCA GGTGCTGGCT TCGAGTTTGA 1300 

AAACAAGGTT GTAGGTGGTG CGATTCCTCG TGAATACATC GGACCTGTTG 1350 

AAAGCGGTGT TAAAGAAGCT ATGGAATCCG GTGTTATCGC TGGGTACCCT 14 00 

ATGGTTGATG TTAAAGTTAT CGTATTTGAT GGTTCTTACC ATGACGTTGA 1450 

CTCCAACGAA ATGGCCTTCA AAATTGCTGG TTCTATGGGC TTCAAAGAAG 1500 

GTGGTCGCAA AGCAGACCCT GCATTGCTTG AACGATATAT GGCTGTAGAA 1550 

GTAGACGTTC CTGAAGAATA CATGGGCGAC GTTATCGGTG ACTTGAACTC 1600 

TCGTCGTGGT CGCATGGACG GCATGGAAGC TCGTAATGGT TCCCAACATA 1650 

TC 1652 



2) INFORMATION FOR SEQ ID NO: 2232 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1624 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Yersinia pseudotuberculosis 

(B) STRAIN: ATCC 29833 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2232 

AGTAGAGCGT TCCATGCGTG TTCTTGACGG CGCGGTAATG GTTTACTGTG 50 

CAGTTGGTGG TGTTCAGCCA CAGTCTGAAA CCGTATGGCG TCAGGCTAAT 100 

AAATATAAAG TTCCACGTAT TGCGTTCGTT AACAAAATGG ACCGTATGGG 150 

TGCGAACTTC CTGCGCGTAG TTGGTCAACT GAAATCTCGC CTTGGTGCGA 200 

ACCCAGTTCC ACTGCAGTTG GCAATTGGCG CAGAAGAAAA ATTCACCGGT 250 

ATTATCGATC TGGTGAAAAT GAAAGCGATC AACTGGAACG AAGCTGATCA 3 00 

GGGCGTGACC TTCGAATATG AAGAAATCCC TGCTGATATG GCTGAACTGG 3 50 
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CTGCTGAATG GCACCAGAAT CTGGTTGAAT CTGCGGCAGA AGCGTCTGAC 400 

GAGCTGATGG ACAAATACTT GGGTGGCGAA GAGCTGACCG AAGAAGAAAT 450 

CAAGAAAGCT TTACGT CAAC GTGTTCTGAA AAGCGAAATT ATTCTTGTTA 500 

CCTGTGGTTC TGCGTTTAAA AACAAAGGCG TACAGGCAAT GCTGGATGCG 550 

GTTATTGAGT ACCTGCCTGC ACCAACTGAC GTTGAATCAA TCAACGGCAT 600 

CTTGGATGAT GGCAAAGATA CTCCGGCTGT TCGTCATTCT GACGACAAAG 650 

AGCCGTTCTC TGCTCTGGCG TTCAAAATCG CTACCGACCC ATTCGTGGGT 700 

AACCTGACGT TCTTCCGCGT GTACTCTGGT ATTGTTAATT CCGGTGATAC 750 

CGTTCTGAAC TCAGTGAAAT CGCAACGTGA ACGCTTAGGT CGTATCGTAC 800 

AGATGCACGC TAACAAGCGT GAAGAGATCA AAGAAGTTCA CGCCGGTGAT 850 

ATCGCAGCCG CTATCGGTCT GAAAGATGTG ACTACGGGTG ACACTTTGTG 900 

TGACCCGAAT AATCCGATCA TCTTGGAACG TATGGAGTTC CCAGAGCCGG 950 

TAATCTCTGT TGCTGTTGAA CCAAAAACCA AAGCTGACCA AGAAAAAATG 1000 

GGTATGGCTC TGGGGCGTTT GGCGAAAGAA GATCCATCAT TCCGCGTTTG 1050 

GACTGACGAA GAATCTGGTC AGACTATCAT CGCTGGTATG GGTGAGTTGC 1100 

ATTTGGATAT CCTGGTTGAC CGTATGCGCC GCGAATTTAA CGTGGAAGCA 1150 

AACGTCGGTA AACCTCAGGT TGCGTACCGT GAAACTATCC GCGAAACCGT 1200 

TAAGGATGTG GAAGGTAAGC ACGCTAAGCA GTCAGGCGGT CGTGGTCAGT 1250 

ATGGTCATGT TGTTATCGAC ATGTCTCCAT TGCCACCGGG TGGTGTTGGG 1300 

TATGAATTCG TCAACGAAAT CGTTGGTGGT TCTATTCCTA AAGAATTCAT 1350 

TCCGGCCGTT GATAAAGGTA TTCAAGAACA GCTGAAATCT GGCCCTCTGG 1400 

CAGGTTACCC AGTTGTTGAC GTTAAAGTGC GTCTGCACTA CGGTTCTTAC 1450 

CATGACGTTG ACTCCTCAGA ATTGGCATTT AAATTAGCTG GTTCTATCGC 1500 

CTTTAAAGAA GGTTTCAAAC GAGCTAAACC AGTTCTGCTT GAGCCAATCA 1550 

TGAAGGTTGA AGTCGAAACC CCTGAAGATT ACATGGGTGA CGTAATGGGC 1600 

GACCTGAACC GTCGTCGCGG TATC 1624 



2) INFORMATION FOR SEQ ID NO: 2233 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1636 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Zoogloea ramigera 

(B) STRAIN: ATCC 25935 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2233 

GAACGCTCGA TGCGCGTGCT TGACGGTGCT TGCATGGTTT ACTGCGCAGT 50 

GGGCGGTGTT CAGCCACAGT CGGAAACCGT GTGGCGTCAG GCTAACAAGT 100 

ACAAAGTGCC ACGTCTGGCC TTCGTCAACA AGATGGACCG TACCGGCGCC 150 

AACTTCTTCA AGGTGTACGA GCAGATGCGT GCTCGCCTGA AGGCCAACCC 200 

GGTCCTGATC CAGATCCCTA TCGGCGCTGA AGACAACTTC AAAGGCGTGA 250 

TCGATCTGGT CAAGATGAAG GCTATCCTGT GGGACGAAGC GTCGCAAGGC 300 

ATGAAATTCG ACTACGTCGA TATTCCTGCA GAGCTGGCTG ATTCGGCCGC 350 

CGAGTGGCGC GAAAAGATGG TTGAAGCTGC TGCTGAAGCC ACCGAAGAGC 400 

TGATGAACAA GTACCTGGAA GAAGGCGACC TGACCGAAGC CGAGATCAAG 4 50 

CAGGCGCTGC GTACCCGTAC CATCGCTTCG GAAATCGTTC CGATGATGTG 500 

CGGTACCGCC TTCAAGAACA AGGGCGTACA GGCCATGCTG GACGCGGTCA 550 

TCGAATACCT GCCATCGCCA CTGGACATCG ACGATGTCGG CGGTACGGAC 600 
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GAAGACGACC AGCCAACCAC CCGTCGCGCA 
GGCGCTGGCC TTCAAGATCA TGACCGACCC 
TCTTCCGCGT GTACTCGGGC GCCGTCAATT 
TCGGTCAAAG GTCGTAAAGA GCGTCTGGGC 
GAATCAGCGC GAAGAGATCA AAGAAGTGCG 
CGGTTGGCCT GAAAGACGTG ACCACGGGCG 
GCCATCATCA CGCTGGAAAA AATGATCTTC 
GGCAGTCGAG CCAAAAACCA AGGCCGACCA 
TGAACCGCCT GGCACAGGAA GATCCTTCGT 
GAATCGGGCC AGACCATCAT CGGTGGWATG 
TATCGTTGAC CGCATGAAGC GCGAATTCGG 
AGCCACAAGT GGCTTACCGC GAAACGATCC 
GAAGGCAAGT TCGTCAAGCA ATCCGGTGGT 
TGTGCTGAAG ATCGAGCCGC AAGAACCAGG 
ACGCCATCAA GGGCGGTACC GTTCCTCGCG 
AAAGGCGTGC GCGGCACCCT GAACACCGGC 
CGTGGACGTC AAGGTCACGC TGTTCTTCGG 
CGAACGAAAA CGCGTTCCAG ATGGCCGCTT 
TGCCGCAAAG CATCGCCAGT CATTCTGGAG 
GGAAACGCCG GAAGACTACG CCGGTACCGT 
GCCGCGGTAT GGTGCAGGGC ATGGACGAAA 



PCT/CA00/01150 

GCTGACGACG AGAAATTCTC 650 
GTTCGTCGGT CAATTGGCCT 700 
CGGGCGACAC CGTGTACAAC 750 
CGTATTCTGC AGATGCACGC 800 
CGCCGGCGAC ATCGCCGCTG 850 
AAACCCTGTG CGATCCGACC 900 
CCTGAGCCTG TGATTCAACA 950 

GGAAAAAATG GGCCTGGCAC 1000 

TCCGCGTGAA GACCGATGAA 1050 

GGCGAGCTGC ACCTGGAAAT 1100 

CGTGGAAGCA ACCGTCGGCA 1150 

GTAAAACCTG CGAAGAATCG 1200 

CGTGGTCAAT ACGGTCACGT 1250 

CAAGGGCTTC GAGTTCGTTG 1300 

AGTACATCCC TGCGGTGGAA 1350 

GTGCTGGCTG GTTACCCGGT 1400 

TTCGTACCAC GATGTGGACT 1450 

CGATGGCATT CAAAGAAGGC 1500 

CCAATGATGG CTGTGGAAGT 1550 

GATGGGCGAC CTGTCGTCCC 1600 

TCCCAG 1636 



2) INFORMATION FOR SEQ ID NO: 2234 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1656 bases 

(B) TYPE: Nucleic acici 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aeromonas hydrophxla 

(B) STRAIN: ATCC 7966 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2234 

GGCACGAACG ATCTTGCCGG ACGGGCCATC TTCCATACCT TCGATCAGGC 50 

CACGACGACG GTTCAGGTCA CCGATCACGT CGCCCATATA GTCTTCCGGA 100 

GTCTCGACTT CTACTTTCAT GATCGGTTCG AGCAGAACCG GGTTGGCCTT 150 

CATGAAGCCA GCCTTAAAGG CCATGGAAGC AGCGATCTTG AACGCCAGTT 200 

CGGAAGAGTC GACATCGTGG TAGGAACCGA AGTGCAGACG CACGCCCAGA 250 

TCCATAACCG GATAACCTGC CAGCGGGCCG GACTTCAGTT GCTCGCGGAT 300 

ACCCTTGTCA ACACCCGGGA TGAACTCACC AGGAATGACG CCGCCCTTGA 350 

TGTCGTTGAC GAATTCGTAG GCTTTGCCTT CTTCCAGCGG GTACATGTCG 4 00 

ATCACAACGT GACCGTACTG ACCACGACCA CCGGACTGCT TGGCGTGCTT 450 

ACCTTCGATA TCCTTGACGG TGTTACGAAT GGTTTCACGG TAGGCAACCT 500 

GCGGCTTACC TACGTTCGCT TCTACCTTGA ACTCGCGACG CATACGGTCA 550 

ACGATGATGT CCAGGTGCAG CTCACCCATA CCGGCGATGA TGGTTTGGCC 600 

AGACTCTTCG TCAGTCCATA CGCGGAAGGA CGGGTCTTCC TGAGCCAGAC 650 

GGCCCAGAGC CAGGCCCATC TTCTCTTGGT CAGCCTTGGT TTTCGGCTCA 700 

ACTGCGATGG AGATTACCGG TTCCGGGAAT TCCATACGCT CGAGGATGAT 750 

CGGCGCTTTT TCGTCACACA GGGTGTCACC GGTGGTCACG TCTTT CAGAC 800 

CAATGGCGGC AGCGATGTCG CCTGCGCGAA CTTCTTTGAT CTCTTCACGC 850 
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TTGTTGGCGT GCATCTGAAC GATACGGCCA AAACGCTCGC GCTTCTCTTT 900 

AACGGAGTTC AGCACGGAGT CACCGGAGTT AACCACACCG GAGTAAACGC 950 

GGAAGAAGGT CAGGTTGCCT ACGAACGGGT CGGTAGCAAT CTTGAATGCC 1000 

AGAGCAGCAA ACGGCTCGTC ATCAGAAGCA TGACGCTCGT CTTTGGTCTC 1050 

GCCATCCAGC TTCAGACCGT CGATGGCTGC TACGTCGGTC GGCGCCGGCA 1100 

GATAGTCAAC CACGGCATCC AGCATGGCCT GTACGCCCTT GTTCTTGAAC 1150 

GCGGAGCCAC AGGTAACCAG GATGATTTCG TTGTTCAGAA CACGCTGACG 1200 

AAGAGCTTTC TTGATCTCTT CCTCGGTCAG TTCTTCACCA CCCAGGTATT 1250 

TTTCCATCAG GTCTTCAGAC GCTTCAGCAG CGGCTTCAAC CAGGGTCATG 1300 

CGCATTTCTT GCGCTTTTTC CAGCAGCTCA GCCGGGACGT CTTCGTAATC 1350 

GAAGGATACG CCCTGGTCAG CTTCGCTCCA GTTGATGGCT TTCATCTTGA 1400 

CCAGGTCGAT AACGCCCTTG AAGTTCTCTT CTGAACCGAT GTTCAGTTGC 1450 

AGCGGAACCG GGTTACCTTT CAGACGGGTC TTGATGTGCT CAACGCAGCG 1500 

CAGGAAGTTG GCACCGGTAC GGTCCATCTT GTTGACGAAC GCGATACGGG 1550 

GAACCTTGTA CTTGTTAGCC TGACGCCATA CGGTTTCAGA CTGTGGCTGT 1600 

ACGCCACHTA CGGCACAGTA CACCATCACG GCGCCGTCCA GAACACGCAT 1650 

GGAACG 1656 



2) INFORMATION FOR SEQ ID NO: 2235 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Abiotrophia adiacens 

(B) STRAIN: ATCC 49175 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2235 

TCTCCTCCCC ATTTGATAAC TACCAAATGA ACGCTATCGA CTGGTTATGT 50 

CAGTCATAAC CAGTTGATTT TTAAGAGAGT TCTTTGGTAT AATTACAATC 100 

GGTAGATACT GTTATAGAAT CTAACAAAAC TCAATTAATA GGAGGAATCA 150 

TTTAA 155 



2) INFORMATION FOR SEQ ID NO: 2236 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetobacter baumannii 

(B) STRAIN: ATCC 19606 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2236 

TCTTTCGATT ACTATAAGCC CTAACTAATT CTTAGTTAAA AACCAAGTGC 50 
TCATGGAGCG ACCCTCCATG AGTAGTTAAT AAAGGAAGAT CATC 94 



2) INFORMATION FOR SEQ ID NO: 2237 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Actinomyces meyeri 

(B) STRAIN: ATCC 35568 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2237 

GTCCCACCGG GGGTGAGTGG GGGCCGGGCA ATCGGCCCCT GCGAGCCGCC 50 
TCACACACCA GTAATCCCAG TAGGTCTCAT GCCCCATGGG TGTTAAAGTG 100 
ACACCTAGCC GTAGGCTGAG AATTTCTACC CGAGTCCAGG AGGACGAAAA 150 



2) INFORMATION FOR SEQ ID NO: 2238 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

• (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 

(B) STRAIN: ATCC 9689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2238 
TTCAGTCCTT TAGGCAAGGA GTTAATTGTC 30 



2) INFORMATION FOR SEQ ID NO: 2239 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium diphtheriae 

(B) STRAIN: ATCC 27010 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2239 

CAACTAGCCT CTCGCTTATC TCCCTATGCA AAATTTCTAG TCAGTTACAG 50 

GGCATAGGGG GAGCGTAGGC GGGGGTAGCG GCTTGCTGAG CACTTCCTCT 100 

ACATCAAAGG GAATGTTGAG CCGGCCGTTA CCCTGTACGA TCCCATCTGG 150 

TTTCTTCGGT GGTTTGATAA ATACCCCGTT GTGACCCTAG GATCATGTAA 200 

CTGGCACAAT GTAAATAGCT GTACTGCCAG GCTGCCGAAT TAGCAGTCAG 250 

AAATGTACAG CACTGTCAAC TCGTGGCTGC GAAATCGTAG CCACCACGAA 300 

GTCCAGGAGG ACACACA 317 



2) INFORMATION FOR SEQ ID NO: 2240 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: En terobacter cloacae 

(B) STRAIN: ATCC 13047 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2240 

TCCACAGGAT TAAAACCTAA GTCCCGTGCT CTCTCCTGAG GGGAGAGCAC 50 
TATAGTAAGG AATATAGCC 69 



2) INFORMATION FOR SEQ ID NO: 2241 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 69 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 13883 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2241 

TCCACAGGAT TAAAACCTAA GTCCCGTGCT CTCTCCGAAG GGGAGAGCAC 50 
TATAGTAAGG AATATAGCC 69 
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2) INFORMATION FOR SEQ ID NO: 2242 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria monocytogenes 

(B) STRAIN : ATCC 15313 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2242 

AACAACAAAG AAGATTAATT GATTTTTTTC GCAACATCAA GTATAACTTT 50 
AGTTAGAAGT ATTACTTAGT TTAAATTTAA GCTAAGTAAA AAATAATTAT 100 
CGAATTATCG AGGAGGATAT TTTAAA 126 



2) INFORMATION FOR SEQ ID NO: 2243 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 170 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium avium 

(B) STRAIN: ATCC 25291 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2243 

GTGTCAACTC ACTGGCTCGG AGCCGAGCAA TCGGCTCAGC GAAGGCGACG 50 

GGTCAGTAGC TACTGGCAGC GGAGTAATCT TGCCGGGTCA TTGGAATGCC 100 

TTGGGCGCGG CACAACTGAA AACACCAACA CTGCTTTAAC AAGCACCAAC 150 

TAGTCCAGGA GGACACAGAA 170 



2) INFORMATION FOR SEQ ID NO: 2244 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 103 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium gordonae 

(B) STRAIN: Mgor-1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2244 



GGCCGGCAAG CCTGCGAGTA AGCTGACGCG GTTAGCACCG CGGCAAAACC 
AAGAAAAATC AACACTGCTT TTTTAAGCAC CAACAGTCCA GGAGGACAAC 
AAA 



50 
100 
103 



2) INFORMATION FOR SEQ ID NO: 2245 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 101 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium kansasii 

(B) STRAIN: Mkan-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2245 

GCCGAGCGTT GCGCGTAAGC TAGCTCGGTT ACCACGGCGG CAAAACTAGA 50 
AAAACATCAA CACTGCTTTT ATAAGCACCA ACAGTCCAGG AGGACACAGA 100 
A 101 



2) INFORMATION FOR SEQ ID NO: 2246 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium terrae 

(B) STRAIN: Mter-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2246 

CCAGCTTCCC GCCCGTGCCG CTACGGTGGC ACAAACCCAA AAAGATCAAT 50 
CCTGCTGTAA CCCAGCACCA ACAAGTCCAG GAGGACAAGA A 91 



2) INFORMATION FOR SEQ ID NO: 2247 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria polysaccharea 

(B) STRAIN: ATCC 43768 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2247 

TCAGGCAAAT AGGCCGTCTG AAAGGCTGAA ATGATTTTTC AGACGGCATT 50 
GTTCTTTAAT CGATCTTTAA TGTAAAGGAA TTAGCTC 87 



2) INFORMATION FOR SEQ ID NO: 2248 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: ATCC 14990 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2248 

TATAACTTGT TAAGACTAGC TATGCTAGGT TAAAATACAG GTTGAGCTTA 50 

TTTATAAGCT GACATTTTTA TGATTTGATT TTTAGGGGTA AATGCATTAT 100 

AAAAGAATTA TAAATTCTTT TATGCTACAC TCAATCAATT TTCTTCTCAT 150 

GATGGTGAGA AACTATCATG AGAGATAAAT TTGAAATAAC TTTTATTAAG 200 

AATAGGAGAG ATTTAATA 218 



2) INFORMATION FOR SEQ ID NO: 2249 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 204 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: ATCC 29970 
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(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 2249 



TTAAACTTGT TTTAGCTAGA ATTTCTAGGT AAAATACAGC GTAAGCTTAT 
TAATTAAGCT AACATCTTTA TGAATTGATT TTTTACTGAA AATGCATTAT 
AAATGAATTA TGAATTCTAA CAATCATTAT GTCTCATGAT GGTGAGAAAC 
TATCATGAGA GATAATATTG AAATAACTTT TACTAGAATA GGAGAGATTT 
AATA 



50 
100 
150 
200 
204 



2) INFORMATION FOR SEQ ID NO: 2250 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2250 
CCGTCGGATG GTGTCGTATA CCGCGGAGTC GCCGACGG 38 



2) INFORMATION FOR SEQ ID NO: 2251 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2251 
CGGAGCCGTT CTCGCTGCGT TACATGCTGG TGGCTCCG 38 



2) INFORMATION FOR SEQ ID NO: 2252 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2252 
CCACATACAG TGTCTCTC 18 
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2) INFORMATION FOR SEQ ID NO: 2253 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2253 
CATTACCCAA CCGAAAGTA 19 



2) INFORMATION FOR SEQ ID NO: 2254 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) - STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2254 
ACCTGAACAG AGAGAAATG 19 



2) INFORMATION FOR SEQ ID NO: 2255 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Abiotrophia adiacens 

(B) STRAIN: ATCC 49175 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2255 

ATGGCAAAAG AAAAATTTGA CCGTTCAAAA CCACACGTTA ACATTGGTAC 50 

AATCGGCCAC GTTGACCACG GTAAAACAAC ATTAACTGCT GCTATCACAA 100 

CTGTTTTAGC TAAGAAAGGT TTCGCGCAAG CTCAAGATTA CGGTTCAATC 150 

GATAAAGCTC CAGAAGAACG CGAACGTGGT ATCACAATCA ACACTTCTCA 200 

CGTTGAGTAC GAAACAGACA CTCGTCACTA TGCTCACGTT GACTGCCCAG 250 

GACACGCGGA CTACGTTAAA AAC 273 
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2) INFORMATION FOR SEQ ID NO: 2256 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Acinetojbacter baumannii 

(B) STRAIN: ATCC 19606 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2256 



ATGGCTAAAG CCAAGTTTGA 
AATCGGTCAC GTTGACCATG 
CAATTTGTGC AAAAACTTAC 
GACTCAGCAC CTGAAGAAAA 
CGTAGAATAC GATTCTCCAA 
GCCACGCCGA CTACGTTAAA 



ACGTAATAAA CCACACGTAA 
GTAAAACAAC TTTAACTGCT 
GGCGGTGAAG CGAAAGATTA 
AGCACGTGGT ATTACAATTA 
CTCGTCACTA CGCACACGTT 
AAC 



ACGTGGGTAC 


50 


GCGATTGCAA 


100 


CTCACAAATC 


150 


ATACATCACA 


200 


GACTGCCCAG 


250 . 




273 



2) INFORMATION FOR SEQ ID NO: 2257 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Actinomyces meyeri 

(B) STRAIN: ATCC 35568 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2257 



GTGGCGAAGG CCAAGTTTGA GCGCACCAAG 
GATTGGTCAC GTTGACCACG GCAAGACGAC 
AGGTGCTGCA TGACAAGTAC CCCGAACTGA 
CAGGTCGACA ACGCTCCCGA GGAGCGCGAT 
CTCTCACGTT GAGTACCAGA CCGAGGCGCG 
CTCCCGGCCA CGCCGACTAC GTCAAGAAC 



CCGCACGTCA ACATCGGCAC 50 
GCTGACGGCA GCTATCACCA 100 
ACGAGTTCAC CCCCTTCGAT 150 
CGTGGCATCA CGATCAACGT 200 
TCACTACGCG CACGTTGACG 250 

279 



2) INFORMATION FOR SEQ ID NO: 2258 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 273 bases 

1158 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Clostridium difficile 

(B) STRAIN: ATCC 9689 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2258 



GTGGCTAAAG AAAAATTTGA TCGTTCCCTA 
TATCGGTCAC GTTGACCATG GTAAAACCAC 
GCGTTTGCTC CGAAGTATTC GGTTCCGCAA 
GACAGCGCAC CAGAAGAAAA AGCTCGTGGT 
CGTTGAATAC AACTCGCTGA TCCGTCACTA 
GTCACGCTGA CTATGTGAAG AAC 



CCGCACGTCA ACGTTGGCAC 50 
TCTGACTGCT GCTCTGACTC 100 
TCGTTGATTT CGATAAAATC 150 
ATCACCATCA ACACCGCGCA 200 
CGCTCACGTT GACTGCCCAG 250 

273 



2) INFORMATION FOR SEQ ID NO: 2259 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Corynebacterium diphtheriae 

(B) STRAIN: ATCC 27010 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2259 

GTGGCAAAGG CTAAGTTCGA GCGTACCAAG CCGCACGTCA ACATCGGCAC 50 

CATCGGTCAC GTTGACCACG GTAAGACCAC CACCACCGCT GCTATCACCA 100 

AGGTTTTGGC AGACGCTTAC CCAGAGCTGA ACGAAGCTTT CGCTTTCGAT 150 

GCCATCGATA AGGCACCGGA AGAGAAAGAG CGTGGTATTA CCATCAACAT 200 

CTCCCACGTG GAGTACCAGA CCGAGAAGCG CCACTACGCA CACGTTGACG 250 

CTCCAGGTCA CGCTGACTAC ATCAAGAAC 279 



2) INFORMATION FOR SEQ ID NO: 2260 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 
(vi) ORIGINAL SOURCE: 
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(A) ORGANISM: En terobacter cloacae 

(B) STRAIN: ATCC 13047 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2260 

GTGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTCA ACGTTGGTAC 50 

TATCGGCCAC GTTGACCATG GTAAAACTAC CCTGACTGCT GCAATCACTA 100 

CCGTTCTGGC TAAAACCTAC GGTGGTTCTG CTCGTGCATT CGACCAGATC 150 

GATAACGCAC CAGAAGAAAA AGCTCGTGGT ATCACCATCA ACACCTCTCA 200 

CGTTGAATAT GACACCCCGA CTCGCCACTA CGCACACGTA GACTGCCCAG 250 

GTCACGCCGA CTATGTTAAA AAC 273 



2) INFORMATION FOR SEQ ID NO: 2261 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Klebsiella pneumoniae subsp. pneumoniae 

(B) STRAIN: ATCC 13883 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2261 

GTGTCTAAAG AAAAATTTGA ACGTACAAAA CCGCACGTCA ACGTTGGTAC 
TATCGGCCAC GTTGACCATG GTAAAACTAC CCTGACTGCT GCCATCACTA 
CCGTTCTGGC TAAAACCTAC GGTGGTTCCG CTCGCGCATT CGACCAGATC 
GATAACGCGC CGGAAGAAAA AGCTCGTGGT ATCACCATCA ACACCTCTCA 
CGTTGAATAT GACACCCCGA CTCGCCACTA CGCGCACGTA GACTGCCCGG 
GCCACGCCGA CTATGTTAAA AAC 



50 
100 
150 
200 
250 
273 



2) INFORMATION FOR SEQ ID NO: 2262 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Listeria monocytogenes 

(B) STRAIN: ATCC 15313 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2262 

ATGGCAAAAG AAAAATTTGA CCGCTCTAAA CCCCATGTTA ACATTGGTAC 50 
TATTGGACAC GTTGACCATG GTAAAACAAC TTTAACTGCT GCAATTACAA 100 
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CTGTACTTGC TAAAAAAGGC TATGCTGATG CACAAGCTTA TGACCAAATT 150 

GATGGTGCTC CAGAAGAAAG AGAACGTGGA ATCACAATCT CTACTGCTCA 200 

CGTTGAGTAC CAAACTGACA GCCGTCACTA TGCACACGTT GACTGCCCAG 250 

GACATGCCGA TTACGTTAAA AAC 273 



2) INFORMATION FOR SEQ ID NO: 2263 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium avium 

(B) STRAIN: ATCC 25291 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2263 



GTGGCGAAGG CGAAGTTCGA GCGGACGAAG CCGCACGTCA ACATCGGGAC 50 

CATCGGTCAC GTTGACCACG GCAAGACCAC GCTGACCGCG GCTATCACCA 100 

AGGTTCTGCA CGACAAGTAC CCGGACCTGA ACGAGTCCCG CGCGTTCGAC 150 

CAGATCGACA ACGCGCCCGA GGAGCGT C AG CGCGGTATCA CCATCAACAT 200 

CTCCCACGTG GAGTACCAGA CCGACAAGCG GCACTACGCT CACGTCGACG 250 

CCCCGGGTCA CGCCGACTAC ATCAAGAAC 279 



2) INFORMATION FOR SEQ> ID NO: 2264 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium gordonae 

(B) STRAIN: Mgor-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2264 



GTGGCGAAGG CGAAGTTCCA GCGGACCAAG CCGCACGTCA ACATCGGGAC 50 

CATCGGTCAC GTTGACCACG GCAAGACCAC TCTGACCGCG GCTATCACCA 100 

AGGTCCTGCA CGACAAGTAC CCGGACCTGA ACGAGTCCAA GGCGTTCGAC 150 

CAGATCGACA ACGCGCCTGA GGAGCGTCAG CGCGGTATCA CGATCAACAT 200 

CGCGCACGTG GAATACCAGA CCGAGAAGCG TCACTACGCG CACGTCGACG 250 

CCCCCGGCCA CGCCGACTAC ATCAAGAAC 279 



2) INFORMATION FOR SEQ ID NO: 2265 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 9 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium kansasii 

(B) STRAIN: Mkan-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2265 



GTGGCGAAGG CGAAGTTCCA GCGGACCAAG 
CATCGGTCAC GTTGACCACG GCAAGACCAC 
AGGTCCTGCA TGACAAGTTC CCGGACCTGA 
CAGATCGACA ACGCTCCTGA GGAGCGCCAG 
CGCGCACGTG GAGTACCAGA CCGAGAAGCG 
CGCCGGGCCA CGCCGACTAC ATCAAGAAC 



CCCCACGTCA ACATCGGGAC 50 
CCTGACCGCG GCTATCACCA 100 
ACGAGTCGAA GGCGTTCGAC 150 
CGCGGTATCA CGATCAACAT 200 
GCACTATGCA CACGTCGACG 250 

279 



2) INFORMATION FOR SEQ ID NO: 2266 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 279 bases 

(B) TYPE: Nucleic acid 

(C) . STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mycobacterium terrae 

(B) STRAIN: Mter-1 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2266 



GTGGCGAAGG CGAAGTTCGA GCGGACGAAG 
CATCGGTCAC GTTGACCACG GCAAGACCAC 
AGGTTCTGCA CGACAAGTAC CCGGACCTCA 
CAGATCGACA ACGCTCCCGA AGAGCGTCAG 
CTCCCACGTG GAGTACCAGA CCGAGAAGCG 
CTCCTGGTCA CGCTGACTAC ATCAAGAAC 



CCGCACGTCA ACATCGGGAC 50 
GCTGACCGCG GCTATCACCA 100 
ACGAGTCGCG TGCGTTCGAC 150 
CGCGGTATCA CCATCAACAT 200 
GCACTACGCC CACGTCGACG 250 

279 



2) INFORMATION FOR SEQ ID NO: 2267 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Neisseria polysaccharea 

(B) STRAIN: ATCC 43768 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2267 

ATGGCTAAGG AAAAATTCGA ACGTAGCAAA CCGCACGTAA ACGTTGGCAC 50 

CATCGGTCAC GTTGACCATG GTAAAACCAC TCTGACTGCT GCTTTGACTA 100 

CTATTTTGGC TAAAAAATTC GGCGGTGCTG CAAAAGCTTA CGACCAAATC 150 

GACAACGCAC CCGAAGAAAA AGCACGCGGT ATTACCATTA ACACCTCGCA 200 

CGTAGAATAC GAAACCGAAA CCCGCCACTA CGCACACGTA GACTGCCCGG 250 

GTCACGCCGA CTACGTTAAA AAC 273 



2) INFORMATION FOR SEQ ID NO: 2268 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus epidermidis 

(B) STRAIN: ATCC 14990 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2268 

ATGGCAAAAG AAAAATTTGA TCGCTCAAAA GAACATG CCA ATATTGGTAC 50 

TATCGGTCAC GTTGACCATG GTAAAACAAC TTTAACAGCT GCTATCGCAA 100 

CTGTATTAGC TAAAAATGGT GACACTGTTG CACAATCATA CGATATGATT 150 

GACAACGCTC CAGAAGAAAA AGAACGTGGT ATTACAATCA ATACTGCACA 200 

TATCGAATAC CAAACTGACA AACGTCACTA TGCTCACGTT GACTGCCCAG 250 

GACACGCTGA CTATGTTAAA AAC 273 



2) INFORMATION FOR SEQ ID NO: 2269 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Staphylococcus haemolyticus 

(B) STRAIN: ATCC 29970 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2269 
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ATGGCAAAAG AAAAATTTGA TCGCTCAAAA 
TATCGGTCAC GTTGACCATG GTAAAACTAC 
CTGTATTAGC TAAAAATGGT GACACTGTAG 
GACAACGCTC CAGAAGAAAA AGAACGTGGT 
CATCGAGTAT CAAACTGACA AACGTCACTA 
GACACGCTGA CTATGTTAAA AAC 
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GAAGATGCCA ATATTGGTAC 50 
TTTAACAGCT GCTATCGCAA 100 
CACAATCATA TGACATGATT 150 
ATCACAATCA ATACTGCACA 200 
TGCTCACGTT GACTGCCCAG 250 

273 



2) INFORMATION FOR SEQ ID NO: 2270 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

<vi) ORIGINAL SOURCE: 

(A) ORGANISM: Aeromonas hydrophila 

(B) STRAIN: ATCC 7966 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2270 



CGATCCTGGT AGTAGCAGCG ACTGACGGCC 
CACATCCTGC TGGGTCGTCA GGTAGGCGTT 
GAACAAGTGT GACATGGTAG ATGACGAAGA 
TGGAAGTTCG CGAACTGCTG TCCGAGTACG 
CCGGTAGTCC GTGGTTCYGC ACTGAAAGCG 
GGAAGAGAAG ATCCTGGAAC TGGCTGGCCA 
AGCCGGAGCG TGCCATCGAC CTGCCGTTCC 
TTCTCCATCG CTGGCCGYGG TACCGTAGTG 
TATCGTCAAA GTTGGTGAAG AAGTGGAAAT 
CCAAGACCAC CTGTACCGGC GTTGAAATGT 
GGTCGTGCAG GCGAGAACAT CGGTGCACTG 
AGACGTAGAG CGTGGTCAGG TACTGGCCAA 
ACACCAAGTT YGAATCTGAA GTGTACGTGC 
CGTCATACCC CGTTCTTCAA AGGCTACCGT 
TACCGACGTG ACCGGTACCA TCGAACTGCC 
TGCCGGGCGA CAACATCAAG ATGGTTGTTA 
ATGGACGACG GC 



CGATGCCGCA GACTCGTGAG 50 

CCGTACATCA TCGTGTTCAT 100 

GCTGCTGGAA CTGGTCGAGA 150 

ACTTCCCGGG TGATGACCTG 200 

CTGGAAGGCG AAGCTCAGTG 250 

CCTGGACACC TACATTCCGG 300 

TGATGCCTAT CGAAGACGTA 350 

ACCGGTCGTG TAGAGCGCGG 400 

CGTKGGTATC AAAGATACCA 450 

TCCGCAAACT GCTGGACGAA 500 

CTGCGTGGCG TGAAGCGTGA 550 

GCCGGGCACC ATCAAGCCGC 600 

TGTCCAAAGA AGAAGGTGGT 650 

CCGCAGTTCT ACTTCCGTAC 700 

GGAAGGCGTA GAGATGGTAA 750 

CCCTGATTGC GCCGATCGCG 800 

812 



2) INFORMATION FOR SEQ ID NO: 2271 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 799 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Bilophila wadsworthia 
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(B) STRAIN: ATCC 4 9260 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2271 

CGACGGTCCC ATGCCCCAGA CCCGTGAGCA CATCCTGCTC GCCCGTCAGG 50 

TCGGCGTGCC TCACCTCGTC GTGTTCATGA ACAAGTGTGA CCTCGTCGAC 100 

GACCCCGAAC TGCTCGAACT CGTCGAAATG GAAGTCCGCG AACTGCTGAG 150 

CTCCTACGGC TACCCCGGCG ATGAAATCCC GGTTGTCCGC GGTTCCGCTC 200 

TGAAGGCTCT GGAATCCGAT AGCGCTGATT CCCCTGACGC CCAGTGCGTG 250 

CTCGAACTGC TCGCCGCTTG CGACAGCTAC TTCCCGGATC CGGTCCGCGA 3 00 

AACCGACAAG CCCTTCCTGA TGCCCATCGA AGACGTGTTC TCCATCTCCG 350 

GCCGCGGTAC CGTGGTCACC GGTCGTGTGG AACGTGGCAT CATCAAGGTC 4 00 

GGCGAAGAAG TCGAAATCGT GGGTATCCGT CCCACCGTGA AGACGACCTG 450 

CACCGGCGTC GAAATGTTCC GCAAGCTGCT CGATCAGGGC CAGGCCGGCG 500 

ACAACATCGG CGCTCTGCTC CGCGGCACGA AGCGTGACGA AGTGGAACGC 550 

GGCCAGGTTC TCGCCGCTCC CAAGAGCATC ACGCCCCACA AGAAGTTCAA 600 

GGCTGAAGTG TACGTTCTGT CCAAGGAAGA AGGCGGCCGC CATACCCCGT 650 

TCTTCACCGG CTATCGTCCT CAGTTCTACT TCCGTACCAC CGACATCACC 700 

GGTATCATCG CTCTTGAAGA AGGCGTTGAA ATGGTTATGC CCGGCGATAA 750 

CGCTACCTTT AATGTCGAGC TCATTCACCC CATCGCCATG GAAAAGGGC 799 



2) INFORMATION FOR SEQ ID NO: 2272 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 786 bases 

(B) . TYPE: Nucleic acid- 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Brevundimonas diminuta 

(B) STRAIN: ATCC 11568 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2272 

GATCCTGGTG TGCTCGGCCG CTGACGGCCC GATGCCGCAG ACCCGCGAGC 50 

ACATCCTGCT GTCGCGTCAG GTCGGCGTTC CGGCGCTGGT GGTGTTCCTG 100 

AACAAGGTCG ACATGGTCGA CGACGAGGAR CTGCTGGAGC TGGTCGAGAT 150 

GGAAGTGCGC GAGCTGCTGA GCTCGTACCA GTTCCCGGGC GACGACATCC 2 00 

CGGTGGTCAA GGGCTCGGCC CTGGCCGCGG TGGAGGATCG CGACCCGCAC 250 

ATCGGCGCCG AGCGCGTTCT GGAGCTGATG GCGGCGGTCG ACAGCTACAT 3 00 

CCCGCAGCCG GAACGTCCGA TCGACATGCC GTTCCTGATG CCGGTGGAAG 3 50 

ACGTGTTCTC GATCTCGGGC CGCGGCACCG TGGTGACGGG TCGCGTCGAG 400 

* CGCGGCGTCG TCAAGGTCGG TGAAGAAGTC GAAATCGTCG GCATCCGTCC 450 

GGTTCAGAAG ACGACCTGCA CGGGCGTCGA AATGTTCCGC AAGCTGCTGG 500 

AYCAGGGTCA RGCCGGCGAC AACGTGGGCG TGCTGCTGCG CGGCACCAAG 550 

CGTGAAGACG TCGAGCGCGG CCAGGTGCTG TGCAAGCCGG GTTCGATCAC 600 

CCCGCACACC AAGTTCGTGG CTGAAGCCTA CATCCTGAAC AAGGAAGAAG 650 

GCGGCCGTCA CACGCCGTTC TTCACGAACT ACCGTCCGCA GTTCTACTTC 700 

CGCACGACGG ACGTGACCGG CATCGTGCGC CTGAAGGAAG GCGTCGAGAT 750 

GATCATGCCG GGCGACAACG CCGAGCTGGA CGTCGA 786 
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2) INFORMATION FOR SEQ ID NO: 2273 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) . STRAIN: LSPQ 2583 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2273 

GCTATTATGG CTGGATTACT ATCCAATACG CTATCCAAGA ATCCCGTAAC 50 

GTACCTGCCG TCAAATCGCT GGAAGCAGTC GGATTAGATA ATTCATTGAA 100 

ATTCCTCAAT GGCCTTGGTA TCAATTATCC TGAGATGCAT TATTCTAATG 150 

CGATTTCAAG TAATACAAGC GAATCTGGTA ACCAATACGG AGCAAGTAGC 200 

GAAAAAATGG CTGCCGCTTA CGCTGCCTTT GCTAATGGCG GTACATATTA 250 

CAAACCGCAA TACGTCAACC GAGTTGTCTT TAGCGACGGT ACAGAAAAAG 300 

TCTTTTCAAA TGGCGGATCA AAAGCCATGA AAGAGACGAC AGC CTACATG 350 

ATGACAGACA TGATGAAGAC CGTTCTTCAG TCTGGAACTG GTACCAATGC 400 

TGCAATCCCA GGAGTTTATC AAGCAGGTAA AACTGGTACT TCCAACTATG 450 

CAGATGATGA GCTAGAGAAG TTGACAAAAC CTTATTACAG TTCTAGCATT 500 

GTCACACCAG ACGAACTATT TGTTGGCTAT ACTCCACAAT ACTCTATGGC 550 

TGTTTGGACA 56 0 

2) INFORMATION FOR SEQ ID NO: 2274 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 551 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 49456 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2274 

GCTACTTTGG AAACATTACT GTCCAATATG CTCTTCAACA ATCACGTAAT 50 

GTCACAGCCG TTGAAACTTT GAATAAGGTC GGTCTAGATA AGGCTAAAGC 100 

CTTCCTTAAT GGGCTTGGTA TTGATTATCC AAGCATGCAT TATGCAAACG 150 

CCATTTCAAG TAATACAACT GAATCCAACA AAAAATACGG TGCAAGTAGT 200 

GAAAAAATGG CTGCTGCCTA CGCTGCTTTT GCTAATGGTG GTATTTACCA 250 

CAAGCCAATG TACATCAATA AAATCGTCTT TAGCGACGGT AGTGAGAAAG 300 

AATTTTCTGA TGCCGGCACA CGAGCTATGA AAGAAACTAC TGCCTATATG 350 

ATGACTGAAA TGATGAAAAC AGTCCTAGTA TACGGTACCG GACGTGGAGC 400 

CTACCTACCA TGGCTTCCAC AAGCAGGTAA GACAGGTACT TCTAACTATA 450 

1166 



SUBSTITUTE SHEET (RULE 26) 



WO 01/23604 



PCT/CA00/01150 



CTGACGACGA AATTGAAAAG TATATCAAGA ACACTGGCTA CGTAGCCCCA 500 
GATGAAATGT TTGTAGGGTA TACTCGTAAA TATGCAATGG CTGTTTGGAC 550 
A 551 



2) INFORMATION FOR SEQ ID NO: 2275 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 560 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus mitis 

(B) STRAIN: ATCC 903 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2275 



GCTATTATGG CTGGATTACT ATCCAATACG 
GTACCAGCCG TCAAATCGCT GGAAGCAGTC 
GTTCCTCAAT GGCCTTGGTA TTAATTACCC 
CGATTTCAAG TAATACAAGC GAATCTGGTA 
GAAAAAATGG CTGCCGCTTA CGCTGCCTTT 
CAAACCGCAA TACGTCAACC GAGTTGTCTT 
TCTTTTCAAA TGGCGGATCA AAAGCCATGA 
ATGACAGACA TGATGAAGAC CGTTCTTCAA 
TGCAATTCCA GGAGTCTATC AAGCAGGTAA 
CAGATGATGA ACTAGAGAAG TTGACAAAAC 
GTCACACCAG ACGAGCTGTT TGTTGGCTAC 
TGTTTGGACA 



CTATCCAAGA ATCCCGTAAC 50 

GGATTAGATA ATTCATTGAA 100 

TGAAATGCAT TATTCTAATG 150 

AC CAATACGG AGCAAGTAGC 2 00 

GCTAATGGCG GTACATATTA 250 

TAGCGACGGT ACAGAAAAAG 3 00 

AAGAAACGAC AGCCTACATG .350 

TCTGGAACTG GTACCAATGC 400 

AACCGGCACT TCCAACTATG 450 

CTTATTACAG TTCTAGCATT 500 

ACTCCACAGT ACTCTATGGC 550 

560 



2) INFORMATION FOR SEQ ID NO: 2276 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 550 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 



(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Streptococcus oralis 

(B) STRAIN: ATCC 35037 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2276 



TTATTTTGGA AATATCACCA TCCAATATGC GCTCCAACAA TCACGGAACG 50 

TTACAGCCGT AGAAACCTTG AACAAAGTCG GTTTGGATAG AGCCAAGACC 100 

TTCCTGAATG GAATCGGTAT TGACTATCCA GATATGCACT ATGCCAACGC 150 

GATTTCAAGT AATACGACTG AGTCAAACAA AAAGTACGGA GCAAGTAGTG 200 
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AGAAAATGGC TGCTGCTTAC GCTGCTTTTG 
AAACCAATGT ATATCAACAA AATCGTCTTT 
ATACGCTGAT CCTGGTACTC GTGCCATGAA 
TGACAGAAAT GATGAAGACT GTCTTGGCAT 
TATCTCCCTT GGCTACCTCA AGCTGGTAAG 
AGATGATGAA ATTGAAAACT ACATCAAAAA 
ACGAAATGTT TGTTGGTTAT ACTCGCAAAT 
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CTAACGGTGG TATCTACCAT 250 

AGCGATGGTA GCTCAAAAGA 300 

AGAGACGACC GCCTATATGA 350 

ACGGAACGGG TCGTGGTGCT 400 

ACTGGTACAT CAAACTATAC 450 

TACTGGTTAT GTAGCCCCAG 500 

ATTCAATGGC TGTWTGGACA 550 



2) INFORMATION FOR SEQ ID NO: 2277 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 356 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia 

(B) STRAIN: ATCC 35401 

(xi) SEQUENCE DESCRIPTION: SEQ 

GCTCCTATCT GGATTATGCG ATGTCGGTCA 
GATGTCCGAG ATGGCCTGAA GCCGGTACAC 
GAACGTACTA GGCAATGACT GGAACAAAGC 
TCGTTGGTGA CGTAATCGGT AAATACCATC 
TATGACACGA TCGTCCGCAT GGCGCAGCCA 
GGTAGACGGT CAGGGTAACT TCGGTTCTAT 
CAATGCGTTA TACGGAAATC CGTCTGGCGA 
GCCGAT 



coli 



ID NO: 2277 

TTGTTGGCCG TGCGCTGCCA 50 

CGTCGCGTAC TTTACGCCAT 100 

CTATAAAAAA TCTGCCCGTG 150 

CCCATGGTGA CTCGGCGGTC 200 

TTCTCGCTGC GTTATATGCT 250 

CGACGGCGAC TCTGCGGCGG 300 

AAATTGCCCA TGAACTGATG 350 

356 



2) INFORMATION FOR SEQ ID NO: 2278 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 347 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 23511 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2278 

CTCTTATCTG GATTATGCGA TGTCGGTCAT TGTTGGCCGC GCGCTGCCGG 50 

ATGTCCGAGA TGGCCTGAAG CCGGTACACC GTCGCGTACT TTACGCCATG 100 

AACGTATTGG GCAATGACTG GAACAAAGCC TACAAAAAAT CAGCCCGTGT 150 

CGTTGGTGAC GTGATCGGTA AATACCACCC GCACGGCGAC TCCGCGGTAT 200 
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ATGACACCAT CGTTCGTATG GCCCAGCCGT 
GTGGATGGCC AGGGGAACTT CGGTTCAATC 
AATGCGTTAT ACGGAAATCC GTCTGGCGAA 
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TCTCGCTGCG CTACATGCTG 250 
GACGGCGACT CCGCCGCGGC 3 00 

AATTGCTCAC GAACTGA 347 



2) INFORMATION FOR SEQ ID NO: 2279 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 362 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 43886 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2279 

AGAGCTCCTA TCTGGATTAT GCGATGTCGG TCATTGTTGG CCGTGCGCTG 50 

CCAGATGTCC GAGATGGCCT GAAGCCGGTA CACCGTCGCG TACTTTACGC 100 

CATGAACGTA CTAGGCAATG ACTGGAACAA AGCCTATAAA AAATCTGCCC 150 

GTGTCGTTGG TGACGTAATC GGTAAATACC ATCCCCATGG TGACTCGGCG 2 00 

GTCTATGACA CGATCGTCCG CATGGCGCAG CCATTCTCGC TGCGTTATAT 250 

GCTGGTAGAC GGTCAGGGTA ACTTCGGTTC TATCGACGGC GACTCTGCGG 300 

CGGCAATGCG TTATACGGAA ATCCGTCTGG CGAAAATTGC CCATGAACTG 350 

ATGGCCGATC TC 362 



2) INFORMATION FOR SEQ ID NO: 2280 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 358 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Escherichia coli 

(B) STRAIN: ATCC 25922 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2280 

CTCCTATCTG GATTATGCGA TGTCGGTCAT TGTTGGCCGT GCGCTGCCAG 50 

ATGTCCGAGA TGGCCTGAAG CCGGTACACC GTCGCGTACT TTACGCCATG 100 

AACGTACTAG GCAATGACTG GAACAAAGCC TATAAAAAAT CTGCCCGTGT 150 

CGTTGGTGAC GTAATCGGTA AATACCATCC CCATGGTGAC TCGGCGGTTT 200 

ATGACACGAT CGTCCGTATG GCGCAGCCAT TCTCGCTGCG TTACATGCTG 250 

GTAGACGGTC AGGGTAACTT CGGTTCCATC GACGGCGACT CTGCGGCGGC 3 00 

AATGCGTTAT ACGGAAATCC GTCTGGCGAA AATTGCCCAT GAACTGATGG 350 

CCGATCTC 358 
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2) INFORMATION FOR SEQ ID NO: 2281 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2281 
CCCCCAGCTG GGCGGCGGTA TCGATGGGGG 



2) INFORMATION FOR SEQ ID NO: 2282 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2282 
AGRRGCIMAR ATGTATGA 
2) INFORMATION FOR SEQ ID NO: 2283 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2283 
ATITATGAYG GKITTCAGAG GC 



2) INFORMATION FOR SEQ ID NO: 2284 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2284 
TCTGWGTRAC IGGYTCKGAG A 



2) INFORMATION FOR SEQ ID NO: 2285 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2285 
CMCCICCWGG TGGWGAWAC 



2) INFORMATION FOR SEQ ID NO: 2286 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2286 
AGTTGCTGTA TTAGGAAATG 



2) INFORMATION FOR SEQ ID NO: 2287 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2287 
TCGAAGTTGC TGTATTAGGA 



2) INFORMATION FOR SEQ ID NO: 2288 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1240 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: BM4339 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2288 

TAGAAGCTGG CTCGTTTTTT TATAAATAAG TTATTCGTTT ATTTTTGTTT 50 

TGTGCTAAAA TATGAGAGTA AATCACTGAA CGATTTAGAA TACAGGAGGA 100 

CAATCTTTTG AAGATTACTT TACTATATGG CGGACGCAGC GCAGAGCAGA 150 

GCATGAAGTG TCCATTCTTT CCGCATTTTC AGTTTTAAAT GCCATTTATT 200 

ATAATTATTA CCAAGTTCAA CTCGTATTTA TTACAAAAGA AGGACAATGG 250 

GTCAAAGGTC CATTACTAAC AGAAAAACCT GCCAGCAAAG ATGTCTTGCA 3 00 

TCTTTCATGG GACCCAAGTG GACAGACAGA GGAAGGCTTT ACAGGAAAAG 350 

TGATCAATCC GGGCGAAATC AAAGAAGAAG GAGCCATCGT TTTTCCAGTT 400 

TTACATGGGC CAAACGGGGA AGATGGAACG ATCCAAGGCT TCTTAGAGAC 450 

ATTGAATATG CCTTATGTCG GCGCAGGCGT ATTGACCAGT GCATGTGCCA 500 

TGGATAAAAT CATGACCAAG TATATTTTAC AAGCTGCTGG TGTGCCGCAA 550 

GTTCCTTATG TACCAGTACT T AAG AAT C AA TGGAAAGAAA ATCCTAAAAA 600 

AGTATTTGAT CAATGTGAAG GTTCTTTGCT TTATCCGATG TTTGTCAAAC 650 

CGGCGAATAT GGGTTCTAGT GTCGGCATTA CAAAAGCAGA AAACCGAGAA 700 

GAGCTGCAAA ATGCTTTAGC AACAGCCTAT CAGTATGATT CTCGAGCAAT 750 

CGTTGAACAA GGAATTGAAG CGCGCGAAAT CGAAGTTGCT GTATTAGGAA 800 

ATGAAGACGT TCGGACGACT TTGCCTGGTG AAGTCGTAAA AGACGTAGCA 850 

TTCTATGATT ATGAAGCAAA ATATATCAAT AATAAAATCG AAATGCAGAT 900 

TCCAGCCGAA GTGCCAGAAG AAGTTTATCA AAAAGCGCAA GAGTACGCGA 950 

AGTTAGCTTA CACGATGTTA GGTGGAAGCG GATTGAGCCG GTGCGATTTC 1000 

TTTTTGACAA ATAAAAATGA ATTATTCCTG AATGAATTAA ACTCTATGCC 1050 

AGGATTTACG GAGTTCAGTA TGTACCCACT CTTATGGGAA AATATGGGCT 1100 

TGAAATACGG TGATTTGATT GAAGAACTGA TCCAGTTAGG AATGAATCGA 1150 

TACCATCAGC GTCAATCTTT TTTTGAAAAA AATGAATAAA GAGAAATAAA 1200 

GAAGAGGCTG GAGTGATTGC GTAACCGCGT TCATTCTAGC 1240 



2) INFORMATION FOR SEQ ID NO: 2289 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) . TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2289 
CACCGAAGAA GATGAAAAAA 
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2) INFORMATION FOR SEQ ID NO: 2290 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS : Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2290 
TGGCACCGAA GAAGATGA 18 



2) INFORMATION FOR SEQ ID NO: 2291 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2291 
ATTTTGGCAC CGAAGAAGA 19 
2) INFORMATION FOR SEQ ID NO: 2292 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2292 
GAATCGGCAA GACAATATG 19 



2) INFORMATION FOR SEQ ID NO: 2293 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 
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(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecium 

(B) STRAIN: BM4147 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2293 

ATGAATAGAA TAAAAGTTGC AATACTGTTT GGGGGTTGCT CAGAGGAGCA 50 

TGACGTATCG GTAAAATCTG CAATAGAGAT AGCCGCTAAC ATTAATAAAG 100 

AAAAATACGA GCCGTTATAC ATTGGAATTA CGAAATCTGG TGTATGGAAA 150 

ATGTGCGAAA AACCTTGCGC GGAATGGGAA AACGACAATT GCTATTCAGC 200 

TGTACTCTCG CCGGATAAAA AAATGCACGG ATTACTTGTT AAAAAGAACC 250 

ATGAATATGA AATCAACCAT GTTGATGTAG CATTTTCAGC TTTGCATGGC 300 

AAGTCAGGTG AAGATGGATC CATACAAGGT CTGTTTGAAT TGTCCGGTAT 350 

CCCTTTTGTA GGCTGCGATA TTCAAAGCTC AGCAATTTGT ATGGACAAAT 400 

CGTTGACATA CATCGTTGCG AAAAATGCTG GGATAGCTAC TCCCGCCTTT 450 

TGGGTTATTA ATAAAGATGA TAGGC CGGTG GCAGCTACGT TTACCTATCC 500 

TGTTTTTGTT AAGCCGGCGC GTTCAGGCTC ATCCTTCGGT GTGAAAAAAG 550 

TCAATAGCGC GGACGAATTG GACTACGCAA TTGAATCGGC AAGACAATAT 600 

GACAGCAAAA TCTTAATTGA GCAGGCTGTT TCGGGCTGTG AGGTCGGTTG . 650 

TGCGGTATTG GGAAACAGTG CCGCGTTAGT TGTTGGCGAG GTGGACCAAA 700 

TCAGGCTGCA GTACGGAATC TTTCGTATTC ATCAGGAAGT CGAGCCGGAA 750 

AAAGGCTCTG AAAACGCAGT TATAACCGTT CCCGCAGACC TTTCAGCAGA 800 

GGAGCGAGGA CGGATACAGG AAACGGCAAA AAAAATATAT AAAGCGCTCG 850 

GCTGTAGAGG TCTAGCCCGT GTGGATATGT TTTTACAAGA TAACGGCCGC 900 

ATTGTACTGA ACGAAGTCAA TACTCTGCCC GGTTTCACGT CATACAGTCG 950 

TTATCCCCGT ATGATGGCCG CTGCAGGTAT TGCACTTCCC GAACTGATTG 1000 

ACCGCTTGAT CGTATTAGCG TTAAAGGGGT GA , 1032 



2) INFORMATION FOR SEQ ID NO: 2294 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2294 
AAACGAGGAT GATTTGATTG 



2) INFORMATION FOR SEQ ID NO: 2295 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2295 
TTGAGCAAGC GATTTCGG 



2) INFORMATION FOR SEQ ID NO: 2296 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1029 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Double 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: Genomic DNA 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Enterococcus faecalis 

(B) STRAIN: V583 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2296 

ATGAATAAAA TAAAAGTCGC AATTATCTTC GGCGGTTGCT CGGAGGAACA 50 

TGATGTGTCG GTAAAATCCG CAATAGAAAT TGCTGCGAAC ATTAATACTG 100 

AAAAATTCGA TCCGCACTAC ATCGGAATTA CAAAAAACGG CGTATGGAAG 150 

CTATGCAAGA AGCCATGTAC GGAATGGGAA GCCGATAGTC TCCCCGCCAT 200 

ATTCTCCCCG GATAGGAAAA CGCATGGTCT GCTTGTCATG AAAG AAAGAG 250 

AATACGAAAC TCGGCGTATT GACGTGGCTT TCCCGGTTTT GCATGGCAAA 300 

TGCGGGGAGG ATGGTGCGAT ACAGGGTCTG TTTGAATTGT CTGGTATCCC 3 50 

CTATGTAGGC TGCGATATTC AAAGCTCCGC AGCTTGCATG GACAAATCAC 4 00 

TGGCCTACAT TCTTACAAAA AATGCGGGCA TCGCCGTCCC CGAATTTCAA 450 

ATGATTGAAA AAGGTGACAA ACCGGAGGCG AGGACGCTTA CCTACCCTGT 500 

CTTTGTGAAG CCGGCACGGT CAGGTTCGTC CTTTGGCGTA ACCAAAGTAA 550 

ACAGTACGGA AGAACTAAAC GCTGCGATAG AAGCAGCAGG ACAATATGAT 600 

GGAAAAATCT TAATTGAGCA AGCGATTTCG GGCTGTGAGG TCGGCTGCGC 650 

GGTCATGGGA AACGAGGATG ATTTGATTGT CGGCGAAGTG GATCAAATCC 700 

GGTTGAGCCA CGGTATCTTC CGCATCCATC AGGAAAACGA GCCGGAAAAA 750 

GGCTCAGAGA ATGCGATGAT TATCGTTCCA GCAGACATTC CGGTCGAGGA 800 

ACGAAATCGG GTGCAAGAAA CGGCAAAGAA AGTATATCGG GTGCTTGGAT 850 

GCAGAGGGCT TGCTCGTGTT GATCTTTTTT TGCAGGAGGA TGGCGGCATC 900 

GTTCTAAACG AGGTCAATAC CCTGCCCGGT TTTACAT CGT ACAGCCGCTA 950 

TCCACGCATG GCGGCTGCCG CAGGAATCAC GCTTCCCGCA CTAATTGACA 1000 

GCCTGATTAC ATTGGCGATA GAGAGGTGA 1029 



2) INFORMATION FOR SEQ ID NO: 2297 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 bases 

(B) TYPE: Nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: Linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2297 
TTCAGGAGGG GGATCGC 
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